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(57) Abstract 

A method for the rational design 
and preparation of vaccines based on HIV 
envelope polypeptides is described. In 
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domains of gpl20 of HIV as depicted in 
the figure. In a preferred embodiment of 
the method, neutralizing epitopes for the 
V2, V3 and C4 domains of gpl20 are 
determined. Also described are DNA se- 
quences encoding gpl20 from preferred 
vaccine strains of HIV. 
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HIV ENVELOPE POLYPEPTIDES 

FIELD OF THE INVENTION 
5 This invention relates to the rational design and 

preparation of HIV vaccines based on HIV envelope 
polypeptides and the resultant vaccines. This 
invention further relates to improved methods for HIV 
serotyping and immunogens which induce antibodies 
10 useful in the serotyping methods. 

BACKGROUND OF THE INVENTION 

Acquired immunodeficiency syndrome (AIDS) is 
caused by a retrovirus identified as the human 

15 immunodeficiency virus (HIV) . There have been intense 
effort to develop a vaccine. These efforts have 
focused on inducing antibodies to the HIV envelope 
protein. Recent efforts have used subunit vaccines 
where an HIV protein, rather than attenuated or killed 

20 virus, is used as the immunogen in the vaccine for 
safety reasons. Subunit vaccines generally include 
gpl2 0, the portion of the HIV envelope protein which is 
on the surface of the virus. 

The HIV envelope protein has been extensively 

2 5 described, and the amino acid and RNA sequences 

encoding HIV envelope from a number of HIV strains are 
known (Myers, G. et al., 1992. Human Retroviruses and 
AIDS. A compilation and analysis of nucleic acid and 
amino acid sequences. Los Alamos National Laboratory, 

3 0 Los Alamos, New Mexico) . The HIV envelope protein is a 

glycoprotein of about 160 kd (gpl60) which is anchored 
in the membrane bilayer at its carboxyl terminal 
region. The N-terminal segment, gpl2 0, protrudes into 
the aqueous environment surrounding the virion and the 
*5 C-terminal segment, gp4l, spans the membrane. Via a 
host-cell mediated process, gpl60 is cleaved to form 
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gpl2 0 and the integral membrane protein gp41. As there 
is no covalent attachment between gpl2 0 and gp41, free 
gpl20 is released from the surface of virions and 
infected cells, 

5 The gpl20 molecule consists of a polypeptide core 

... ^ 
of 60,000 daltons which is extensively modified by 

N-linked glycosylation to increase the apparent 

molecular weight of the molecule to 12 0,000 daltons. 

The amino acid sequence of gpl2 0 contains five 

10 relatively conserved domains interspersed with five 
hypervariable domains. The positions of the 18 
cysteine residues in the gpl2 0 primary sequence, and 
the positions of 13 of the approximately 24 N-linked 
glycosylation sites in the gpl2 0 sequence are common to 

15 all gpl2 0 sequences. The hypervariable domains contain 
extensive amino acid substitutions, insertions and 
deletions. Sequence variations in these domains result 
in up to 30% overall sequence variability between gpl20 
molecules from the various viral isolates. Despite 

2 0 this variation, all gpl2 0 sequences preserve the 

virus's ability to bind to the viral receptor CD4 and 
to interact with gp41 to induce fusion of the viral and 
host cell membranes. 

gpl20 has been the object of intensive 
25 investigation as a vaccine candidate for subunit 

vaccines, as the viral protein which is most likely to 
be accessible to immune attack. gpl20 is considered to 
be a good candidate for a subunit vaccine, because 
(i) gpl2 0 is known to possess the CD4 binding domain by 

3 0 which HIV attaches to its target cells, (ii) HIV 

infect ivity can be neutralized in vitro by antibodies 
to gp 120, (iii) the majority of the in vitro «• 
neutralizing activity present in the serum of HIV 
infected individuals can be removed with a gpl20 •» 
35 affinity column, and (iv) the gpl20/gp41 complex 
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appears to be essential for the transmission of HIV by 
cell-to-cell fusion. 

The identification of epitopes recognized by virus 
. neutralizing antibodies is critical for the rational 
5 design of vaccines effective against HIV-l infection* 
One way in which antibodies would be expected to 
neutralize HIV-l infection is by blocking the binding 
of the HIV-l envelope glycoprotein, gpl20, to its 
cellular receptor, CD4 . However, it has been 

10 surprising that the CD4 blocking activity, readily 

demonstrated in sera from HIV-l infected individuals 
(31, 44) and animals immunized with recombinant 
envelope glycoproteins (1-3) , has not always correlated 
with neutralizing activity (2, 31, 44). Results 

15 obtained with monoclonal antibodies have shown that 

while some of the monoclonal antibodies that block the 
binding of gpl20 to CD4 possess neutralizing activity, 
others do not (4, 7, 16, 26, 33, 35, 43, 45), When the 
neutralizing activity of CD4 blocking monoclonal 

20 antibodies are compared to those directed to the 

principal neutralizing determinant (PND) located in the 
third variable domain (V3 domain) of gpl20 (10, 39), 
the CD4 blocking antibodies appear to be significantly 
less potent. Thus, CD4 blocking monoclonal antibodies 

25 typically exhibit 50% inhibitory concentration values 
(IC 50 ) in the 1-10 iiq/ral range (4, 16, 26, 33, 35, 43, 
45) whereas PND directed monoclonal antibodies 
typically exhibit IC 50 values in the 0.1 to 1.0 nq/ml 
range (23, 33 f 42). 

3 0 Subunit vaccines, based on gp!20 or another viral 

protein, that can effectively induce antibodies that 
neutralize HIV are still being sought. However, to 
date no vaccine has not been effective in conferring 
protection against HIV infection. 

35 
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DESCRIPTION OF THE BACKGROUND ART 

Recombinant subunit vaccines are described in 
Berman et al . , PCT/US91/02250 (published as number 
W091/15238 on 17 October 1991) • See also, e.g. Hu et 
5 al., Nature 328:721-724 (1987) (vaccinia virus-HIV 

envelope recombinant vaccine); Arthur et al . , J . Virol. 
63(12): 5046-5053 (1989) (purified gpl20) ; and Berman 
et al., Proc. Natl. Acad. Sci . USA 85:5200-5204 (1988) 
(recombinant envelope glycoprotein gpl20) . 

10 Numerous sequences for gpl2 0 are known. The 

sequence of gpl20 from the IIIB substrain of HIV-1^ 
referred to herein is that determined by Muesing et 
al . , "Nucleic acid structure and expression of the 
human AIDS/lymphadenopathy retrovirus, Nature 

15 313:450-458 (1985). The sequences of gp!20 from the 
NY-5, Jrcsf, Z6, Z321, and HXB2 strains of HIV-1 are 
listed by Myers et al . , "Human Retroviruses and AIDS; A 
compilation and analysis of nucleic acid and amino acid 
sequences," Los Alamos National Laboratory, Los Alamos, 

20 New Mexico (1992). The sequence of the Thai isolate 

A244 is provided by McCutchan et al . , "Genetic Variants 
of HIV- 1 in Thailand," AIDS Res. and Human Retroviruses 
8:1887-1895 (1992). The MN 19M clone is described by 
Gurgo et al . , "Envelope sequences of two new United 

25 States HIV-1 isolates," Virol. 164: 531-536 (1988). 
The amino acid sequence of this MN clone differs by 
approximately 2% from the MN-gpl20 clone (MNq^) 
disclosed herein and obtained by Berman et al . 
Each of the above-described references is 

3 0 incorporated herein by reference in its entirety. 

SUMMARY OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
3 5 HIV envelope polypeptides. This invention is based on 
the discovery that there are neutralizing epitopes in 
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the V2 and C4 domains of gpl2 0, in addition to the 
neutralizing epitopes in the V3 domain. In addition, 
the amount of variation of the neutralizing epitopes is 
highly constrained, facilitating the design of an HIV 
5 subunit vaccine that can induce antibodies that 
neutralize a plurality of HIV strains for a given 
geographic region. 

In one embodiment, the present invention provides 
a method for making an HIV gpl20 subunit vaccine for a 

10 geographic region in which a neutralizing epitope in 

the V2 and/or C4 domains of gp!2 0 of HIV isolates from 
the geographic region is determined and an HIV strain 
having gpl20 which has a neutralizing epitope in the V2 
or C4 domain which is common among isolates in the 

15 geographic region is selected and used to make the 
vaccine. 

In a preferred embodiment of the method, * 
neutralizing epitopes for the V2, V3, and C4 domains of 
gpl2 0 from HIV isolates from the geographic region are 

20 determined. At least two HIV isolates having different 
neutralizing epitopes in the V2 , V3 , or C4 domain are 
selected and used to make the HIV gpl2 0 subunit 
vaccine. Preferably, each of the selected isolates 
have one of the most common neutralizing epitopes for 

25 the V2, V3, or C4 domains. 

The invention also provides a multivalent HIV 
gpl20 subunit vaccine. The vaccine comprises gpl20 
from two isolates of HIV having at least one different 
neutralizing epitope. Preferably, the isolates have 

30 the most common neutralizing epitopes in the geographic 
region for one of the domains. 

A DNA sequence of less than 5 kilobases encoding 
gpl20 from preferred vaccine strains of HIV, GNE 8 and 
GNE 16 , expression construct comprising the GNE 8 -gpl20 and 

3 5 GNE l6 -gpl20 encoding DNA under the transcriptional and 
translational control of a heterologous promoter, and 
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isolated GNE 8 -gpl20 and GNE, 6 -gpl20 are also provided. 
The invention further provides improved methods for HIV 
serotyping in which epitopes in the V2 or C4 domains of 
gpl2 0 are determined and provides immunogens (truncated 
5 gpl20 sequences) which induce antibodies useful in the 
serotyping methods. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 describes inhibition of CD4 binding by 

10 monoclonal antibodies to recombinantly produced gpl2 0 
from the MN strain of HIV (MN-rgpl20) . Mice were 
immunized with MN-rgpl20 and the resulting splenocytes 
were fused with the NP3X63 . Ag8 . 653 cell line as 
described in Example 1. Thirty-five stable hybridoma 

15 clones, reactive with MN-rgpl20 were identified by 

ELISA. Secondary screening revealed seven cell lines 
(1024, 1093, 1096, 1097, 1110, 1112, and 1027) 
secreting antibodies able to inhibit the binding of 
MN-rgpl20 to biotin labeled recombinantly produced CD4 

20 (rsCD4) in a ELISA using HRPO-strepavadin . Data 
obtained with monoclonal antibodies from the same 
fusion (1026, 1092, 1126) that failed to inhibit 
MN-rgpl20 binding to CD4 is shown for purposes of 
comparison. 

25 FIGURE 2 shows neutralizing activity of CD4- 

blocking monoclonal antibodies to MN-rgpl20. 
Monoclonal antibodies that blocked the binding of 
MN-rgpl20 to CD4 were screened for the capacity to 
inhibit the infection of MT2 cells by the MN strain of 

30 HIV-1 in vitro. Cell free virus was added to wells 

containing serially diluted antibodies and incubated at 
4°C for 1 hr. After incubation, MT-2 cells were added 
to the wells and the cultures were then grown for 5 
days at 37 °C. Cell viability was then measured by 

35 addition of the color imetric tetrazolium compound MTT 
as described in reference (35) of Example 1. The 
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optical densities of each well were measured at 540 run 
using a microtiter plate reading spectrophotometer. 
Inhibition of virus infectivity was calculated by 
dividing the mean optical densities from wells 
5 containing monoclonal antibodies by the mean value of 

it 

wells that received virus alone. Monoclonal antibodies 
that blocked CD4 binding are the same as those 
indicated in Figure Legend 1. Data from the V3~ 
directed monoclonal antibody to MN-rgpl20 (1034) is 
10 provided as a positive control. Data obtained with the 
V3 directed monoclonal antibody, 11G5, specific for the 
IIIB strain of HIV-l (33) is shown as a negative 
control. 

FIGURE 3 is a diagram of gpl20 fragments used to 

15 localize the epitopes recognized by the CD4 blocking 
monoclonal antibodies to MN-rgpl20. A series of 
fragments (A) corresponding to the V4 and C4 domains 
(B) (SEQ. ID. NO. 14) of the gene encoding MN-rgpl20 
were prepared by PCR. The gpl20 gene fragments were 

2 0 fused to a fragment of the gene encoding Herpes Simplex 
Virus Type 1 glycoprotein D that encoded the signal 
sequence and 25 amino acids from the mature amino 
terminus. The chimeric genes were assembled into a 
mammalian cell expression vector (PRK5) that provided a 

2 5 CMV promoter, trans lational stop codons and an SV4 0 
polyadenylation site. The embryonic human kidney 
adenocarcinoma cell line, 293s, was transf ected with 
the resulting plasmid and recombinant proteins were 
recovered from growth conditioned cell culture medium. 

30 Fragments of MN-rgpl20, expressed as HSV-l Gd fusion 
proteins, were produced by transient transf ection of 
293s cells (Example 1) . To verify expression, cells 
were metabolically labeled with [ 35 S] -methionine , and 
* the resulting growth conditioned cell culture 

35 supernatants were immunoprecipitated (C) using a 
monoclonal antibody, 5B6, specific for the amino 
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terminus of HSV-1 Gd and fixed S. aureus. The 
immunoprecipitated proteins were resolved on 4 to 20 % 
acrylamide gradient gels using SDS-PAGE and visualized 
by autoradiography. The samples were: Lane 1, 
5 FMN. 368-408; lane 2, FMN. 368-451; lane 3, FMN. 419-443; 
lane 4, FMN. 414-451; lane 5, MN-rgpl20. The gel 
demonstrated that the proteins were expressed and 
migrated at the expected molecular weights. 

FIGURE 4 shows a C4 domain sequence comparison 

10 (SEQ. ID. Nos. 3-13) . The C4 domain amino acid 

sequences of recombinant and virus derived gpl20s used 
for monoclonal antibody binding studies were aligned 
starting the amino terminal cysteine. Amino acid 
positions are designated with respect to the sequence 

15 of MN-rgpl20. Sequences of the LAI substrains, IIIB, 
BH10, Bru, HXB2, and HXB3 are shown for purposes of 
comparison. 

FIGURE 5 shows sequences of C4 domain mutants of 
MN-rgpl2 0 (SEQ. ID. Nos. 3 and 15-23) . Nucleotide 

20 substitutions, resulting in the amino acid sequences 
indicated, were introduced into the C4 domain of 
MN-rgpl2 0 gene using recombinant PGR. The resulting 
variants were assembled into the expression plasmid, 
pRK5, which was then transfected into 293s cells. The 

25 binding of monoclonal antibodies to the resulting C4 
domain variants was then analyzed (Table 5) by ELISA. 

FIGURE 6 illustrates the reactivity of monoclonal 
antibody 1024 with HIV-1^, substrains. The cell 
surface binding of the C4 domain reactive monoclonal 

30 antibody 1024 to H9 cells chronically infected with the 
IIIB, HXB2 , HXB3, and HXB10 substrains of HIV-1 LAI or 
HIV-1MN was analyzed by flow cytometry. Cultures of 
virus infected cells were reacted with either 

monoclonal antibody 1024, a nonrelevant monoclonal * 
35 antibody (control) , or a broadly cross reactive 

monoclonal antibody (1026) raised against rgpl20. 
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After washing away unbound monoclonal antibody, the 
cells were then labeled with fluorescein conjugated 
goat antibody to mouse IgG (Fab') 2 , washed and fixed 
with paraformaldehyde. The resulting cells were 
5 analyzed for degree of fluorescence intensity using a 
FACSCAN (Becton Dickenson, Fullerton, CA) . 
Fluorescence was measured as mean intensity of the 
cells expressed as mean channel number plotted on a log 
scale. 

10 FIGURE 7 shows the determination of the binding 

affinity of monoclonal antibodies for MN-rgpl20. CD4 
blocking monoclonal antibodies raised against MN-rgpl2 0 
(1024 and 1097) or IIIB-rgpl20 (13H8 and 5C2) were ■ 
labeled with [ ,25 I] and binding titrations using 

15 MN-rgpl2 0 (A and B) or IIIB-rgpl2 0 (C and D) were 

carried out as described in the Example 1. A, binding 
of monoclonal antibody 1024; B binding of monoclonal 
antibody 1097; C, binding of monoclonal antibody 13H8; 
and D binding of monoclonal antibody 5C2 . 

2 0 FIGURE 8 shows the correlation between gpl20 

binding affinity (K d ) and neutralizing activity (IC50) 
of monoclonal antibodies to the C4 domain of MN-rgpl20. 
Binding affinities of monoclonal antibodies to the C4 ' 
domain of gpl20 were determined by Scatchard analysis 

25 (Figure 9, Table 5). The resulting values were plotted 
as a function of the log of their neutralizing 
activities (IC 50 ) determined in Figure 2 and Table 6. 
. FIGURE 9 depicts the amino acid sequence of the 
mature envelope glycoprotein (gpl20) from the MNq,^ 

30 clone of the MN strain of HIV-1 (SEQ. ID. NO. 1) . 

Hypervariable domains are from 1-29 (signal sequence) , 
131-156, 166-200,305-332, 399-413, and 460-469. The V 
and C regions are indicated (according to Modrow et 
al., J. Virology 61(2) :570 (1987). Potential 

35 glycosylation sites are marked with a (*) . 

-9- 



BNSDOCID: <WO 9428929A1_L> 



WO 94/28929 



PCT/US94/06036 



FIGURE 10 depicts the amino acid sequence of a 
fusion protein of the residues 41-511 of the mature 
envelope glycoprotein (gpl20) from the MNq^ clone of 
the MN strain of HIV-1, and the gD-1 amino terminus 
5 from the herpes simplex glycoprotein gD-1. (SEQ. ID. 
NO. 2) . The V and C regions are indicated (according 
to Modrow et al . , «J. Virology 61(2) :570 (1987). 
Potential glycosylation sites are marked with a (*) . 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
- HIV envelope polypeptides. This invention is based on 
the discovery that there are neutralizing epitopes in 

15 the V2 and C4 domains of gpl2 0, in addition to the 

neutralizing epitopes in the V3. domain. Although the 
amino acid sequences of the neutralizing epitopes in 
the V2, V3 , and C4 domains are variable, it has now 
been found that the amount of variation is highly 

20 constrained. The limited amount of variation 

facilitates the design of an HIV subunit vaccine that 
can induce antibodies that neutralize the most common 
HIV strains for a given geographic region. In 
particular, the amino acid sequence of neutralizing 

25 epitopes in the V2 , V3, and C4 domains for isolates of 
a selected geographic region is determined. gpl20 from 
isolates having the most common neutralizing epitope 
sequences are utilized in the vaccine. 

The invention also provides a multivalent gpl20 

3 0 subunit vaccine wherein gpl2 0 present in the vaccine is 
from at least two HIV isolates which have different 
amino acid sequences for a neutralizing epitope in the * 
V2, V3, or C4 domain of gpl20. The invention further 
provides improved methods for HIV serotyping in which 

35 epitopes in the V2 or C4 domains of gpl20 are 

-10- 



BNSOOCID: <WO 9428929A1 _!_> 



WO 94/28929 



PCT/US94/06036 



determined and provides immunogens which induce 
antibodies useful in the serotyping methods . 

The term "subunit vaccine" is used herein, as in 
the art, to refer to a viral vaccine that does not 
5 contain virus, but rather contains one or more viral 
proteins or fragments of viral proteins* As used 
herein, the term "multivalent" means that the vaccine 
contains gpl2 0 from at least two HIV isolates having 
different amino acid sequences for a neutralizing 
10 epitope. 

Vaccine Design Method 

The vaccine design method of this invention is 
based on the discovery that there are neutralizing 
15 epitopes in the V2 and C4 domains of gpl20, in addition 
to those found in the principal neutralizing domain 
(PND) in the V3 domain. Selecting an HIV isolate with , 
appropriate neutralizing epitopes in the V2 and/or C4 
domains provides a vaccine that is designed to induce 

2 0 immunity to the HIV isolates present in a selected 

geographic region. In addition, although the amino 
acid sequence of the V2 , V3 , and C4 domains containing ... 
the neutralizing epitopes is variable, the amount of 
variation is highly constrained, facilitating the 
25 design of a multivalent vaccine which can neutralize a 
plurality of the most common HIV strains for a given 
geographic region. 

The method for making an HIV gp!20 subunit vaccine 
depends on the use of appropriate strains of HIV for a 

3 0 selected geographic region. Appropriate strains of HIV 

for the region are selected by determining the 
neutralizing epitopes for HIV isolates and the 
percentage of HIV infections attributable to each 
strain present in the region. HIV strains which have 
3 5 the most common neutralizing epitopes in the V2 or C4 
domains in the geographic region are selected. 
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Preferably, isolates that confer protection against the 
most common neutralizing epitopes in the V2 , V3 , and C4 
domains for a geographic region are selected. 

One embodiment of the method for making an HIV 
5 gpl2 0 subunit vaccine from appropriate strains of HIV 
for a geographic region comprises the following steps, 
A neutralizing epitope in the V2 or C4 domain of gpl20 
of HIV isolates from the geographic region is 
determined. An HIV strain having gpl2 0 with a 

10 neutralizing epitope in the V2 or C4 domain that is 

common among HIV isolates in the geographic region is 
selected. gpl20 from the selected isolate is used to 
make an HIV gpl20 subunit vaccine. 

In another embodiment of the method, the 

15 neutralizing epitopes in the V2 , V3 , and C4 domains of 
gp!20 from HIV isolates from the geographic region are 
determined. At least two HIV isolates having different 
neutralizing epitopes in the V2 , V3 , or C4 domain are 
selected and used to make an HIV gpl2 0 subunit vaccine. 

2 0 Preferably, the vaccine contains gpl2 0 from at least 
the two or three HIV strains having the most common 
neutralizing epitopes for the V2, V3, or C4 domains. 
More preferably, the vaccine contains gpl20 from 
sufficient strains so that at least about 50%, 

25 preferably about 70%, more preferably about 80% or more 
of the neutralizing epitopes for the V2, V3, and C4 
domains in the geographic region are included in the 
vaccine. The location of the neutralizing epitopes in 
the V3 region are well known. The location of the 

30 neutralizing epitopes in the V2 and C4 regions are 
described hereinafter. 

Each of the steps of the method are described in 
detail below. 

35 Determining neutralizing epitopes 
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The first step in designing a vaccine for a 
selected geographic region is to determine the 
neutralizing epitopes in the gp!20 V2 and/or C4 
domains. In a preferred embodiment , neutralizing 
5 epitopes in the V3 domain (the principal neutralizing 
domain) are also determined. The location of 
neutralizing epitopes in the V3 domain is well known. 
Neutralizing epitopes in the V2 and C4 domains have now 
been found to be located between about residues 163 and 

10 200 and between about residues 420 and 440, 

respectively,. In addition, the critical residues for 
antibody binding are residues 171, 173, 174, 177, 181, 
183, 187, and 188 in the V2 domain and residues 429 and 
4 32 in the C4 domain, as described in detail in the 

15 Examples. 

The neutralizing epitopes for any isolate can be 
determined by sequencing the region of gpl2 0 containing 
the neutralizing epitope. Alternatively, when 
antibodies specific for the neutralizing epitope, 

20 preferably monoclonal antibodies, are available the 

neutralizing epitope can be determined by serological 
methods as described hereinafter. A method for 
identification of additional neutralizing epitopes in 
gpl20 is described hereinafter. 

25 When discussing the amino acid sequences of 

various isolates and strains of HIV, the most common 
numbering system refers to the location of amino acids 
within the gpl20 protein using the initiator methionine 
residue as position 1. The amino acid numbering 

30 reflects the mature HIV-1 gpl20 amino acid sequence as 
shown by Figures 9 and Fig. 10 [SEQ. ID Nos. 1 and 2]. 
For gpl20 sequences derived from other HIV isolates and 
which include their native HIV N-terminal signal 
sequence, numbering may differ. Although the 

35 nucleotide and amino acid residue numbers may not be 

applicabl in other strains where upstr am deletions or 
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insertions change the length of the viral genome and 
gpl2 0 / the region encoding the portions of gpl2 0 is 
readily identified by reference to the teachings 
herein. The variable (V) domains and conserved (C) 
5 domains of gpl20 are specified according to the 
nomenclature of Modrow et al . "Computer-assisted 
analysis of envelope protein sequences of seven human 
immunodeficiency virus isolates: predictions of 
antigenic epitopes in conserved and variable regions," 

10 J. Virol. 61:570-578 (1987). 

The first step in identifying the neutralizing 
epitopes for any region of gpl20 is to immunize an 
animal with gpl20 to induce anti-gpl20 antibodies. The 
antibodies can be polyclonal or, preferably, 

15 monoclonal. Polyclonal antibodies can be induced by 
administering to the host animal an immunogenic 
composition comprising gpl20. Preparation of 
immunogenic compositions of a protein may vary 
depending on the host animal and the protein and is 

2 0 well known. For example, gpl20 or an antigenic portion 

thereof can be conjugated to an immunogenic substance 
such as KLH or BSA or provided in an adjuvant or the 
like. The induced antibodies can be tested to 
determine whether the composition is specific for 
25 gp!20. If a polyclonal antibody composition does not 
provide the desired specificity, the antibodies can be 
fractionated by ion exchange chromatography and 
immunoaf f inity methods using intact gpl20 or various 
fragments of gpl2 0 to enhance specificity by a variety 

3 0 of conventional methods. For example, the composition 

can be fractionated to reduce binding to other 
substances by contacting the composition with gpl20 
affixed to a solid substrate. Those antibodies which 
bind to the substrate are retained. Fractionation 
35 techniques using antigens affixed to a variety of solid 
substrates such as affinity chromatography materials 
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including Sephadex, Sepharose and the like are well 
known. 

Monoclonal anti-gpl2 0 antibodies can be produced 
by a number of conventional methods. A mouse can be 
5 injected with an immunogenic composition containing 
gpl20 and spleen cells obtained. Those spleen cells 
can be fused with a fusion partner to prepare 
hybridomas. Antibodies secreted by the hybridomas can 
be screened to select a hybridoma wherein the 
10 antibodies neutralize HIV infectivity, as described 

hereinafter. Hybridomas that produce antibodies of the 
desired specificity are cultured by standard 
techniques. 

Infected human lymphocytes can be used to prepare 

15 human hybridomas by a number of techniques such as 

fusion with a murine fusion partner or transformation 
with EBV. In addition, combinatorial libraries of 
human or mouse spleen can be expressed in E. coli to 
produce the antibodies. Kits for preparing 

20 combinatorial libraries are commercially available. 

Hybridoma preparation techniques and culture methods 
are well known and constitute no part of the present 
invention. Exemplary preparations of monoclonal 
antibodies are described in the Examples. 

25 Following preparation of anti-gp!20 monoclonal 

antibodies, the antibodies are screened to determine 
those antibodies which are neutralizing antibodies. 
Assays to determine whether a monoclonal antibody 
neutralizes HIV infectivity are well known and are 

3 0 described in the literature. Briefly, dilutions of 

antibody and HIV stock are combined and incubated for a 
time sufficient for antibody binding to the virus. 
Thereafter, cells that are susceptible to HIV infection 
are combined with the virus /antibody mixture and 

3 5 cultured. MT-2 cells or H9 cells are susceptible to 
infection by most HIV strains that are adapted for 
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growth in the laboratory. Activated peripheral blood 
mononuclear cells (PBMCs) or macrophages can be 
infected with primary isolates (isolates from a patient 
specimens which have not been cultured in T-cell lines 
5 or transformed cell lines). Daar et al, Proc. Natl. 

Acad. Sci . USA 87 : 6574-6578 ( 1990) describe methods for 
infecting cells with primary isolates. 

After culturing the cells for about five days, the 
number of viable cells is determined, as by measuring 

10 metabolic conversion of the formazan MTT dye. The 

percentage of inhibition of infectivity is calculated 
to determine those antibodies that neutralize HIV. An 
exemplary preferred procedure for determining HIV 
neutralization is described in the Examples. 

15 Those monoclonal antibodies which neutralize HIV 

are used to map the epitopes to which the antibodies 
bind. To determine the location of a gpl20 
neutralizing epitope, neutralizing antibodies are 
combined with fragments of gp!20 to determine the 

20 fragments to which the antibodies bind. The gp!20 

fragments used to localize the neutralizing epitopes 
are preferably made by recombinant DNA methods as 
described hereinafter and exemplified in the Examples. 
By using a plurality of fragments, each encompassing 

25 different, overlapping portions of gpl20, an amino acid 
sequence encompassing a neutralizing epitope to which a 
neutralizing antibody binds can be determined. A 
preferred exemplary determination of the neutralizing 
epitopes to which a series of neutralizing antibodies 

30 binds is described in detail in the Examples. 

This use of overlapping fragments can narrow the 
location of the epitope to a region of about 2 0 to 4 0 
residues. To confirm the location of the epitope and 
narrow the location to a region of about 5 to 10 

35 residues, site-directed mutagenicity studies are 

preferably performed. Such studi s can also d termine 
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the critical residues for binding of neutralizing 
antibodies- A preferred exemplary site-directed 
mutagenicity procedure is described in the Examples . 
To perform site-directed mutagenicity studies, 
5 recombinant PCR techniques can be utilized to introduce 
single amino acid substitutions at selected sites into 
gpl20 fragments containing the neutralizing epitope. 
Briefly, overlapping portions of the region containing 
the epitope are amplified using primers that 

10 incorporate the desired nucleotide changes • The 

resultant PCR products are annealed and amplified to 
generate the final product. The final product is then 
expressed to produce a mutagenized gpl20 fragment. 
Expression of DNA encoding gpl20 or a portion thereof 

15 is described hereinafter and exemplified in the 
Examples. 

In a preferred embodiment described in Example 1, , 
the gpl2 0 fragments are expressed in mammalian cells 
that are capable of expression of gpl20 fragments 

20 having the same glycolsylation and disulfide bonds as *" 
native gpl20. The presence of proper glycolsylation ^ 
and disulfide bonds provides fragments that are more 
likely to preserve the neutralizing epitopes than 
fragments that are expressed in E. coli, for example, 

25 which lack disulfide bonds and glycosylation or are 
chemically synthesized which lack glycolsylation and 
may lack disulfide bonds. 

Those mutagenized gpl20 fragments are then used in 
an immunoassay using gpl20 as a control to determine 

3 0 the mutations that impair or eliminate binding of the 
neutralizing antibodies. Those critical amino acid 
residues form part of the neutralizing epitope that can 
only be altered in limited ways without eliminating the 
epitope. Each alteration that preserves the epitope 

35 can be determined. Such mutagenicity studies 

demonstrate the variations in the amino acid sequence 
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of the neutralizing epitope that provide equivalent or 
diminished binding by neutralizing antibodies or 
eliminate antibody binding. Although the amino acid 
sequence of gpl20 used in the vaccine preferably is 
5 identical to that of a selected HIV isolate for the 

given geographic region, alterations in the amino acid 
sequence of neutralizing epitope that are suitable for 
use in a vaccine can be determined by such studies . 
Once a neutralizing epitope is localized to a 
10 region of ten to twenty amino acids of gpl2 0, the amino 
acid sequence of corresponding neutralizing epitopes of 
other HIV isolates can be determined by identifying the 
corresponding portion of the gp!2 0 amino acid sequence 
of the isolate. 

15 Once the neutralizing epitopes for a given region 

of gpl2 0 are determined, the amino acid sequence of HIV 
isolates for the geographic region are determined. The 
complete amino acid sequence for numerous isolates has 
been determined and is available from numerous journal 

20 articles and in databases. In such cases, 

determination of the amino acid sequence of HIV 
isolates for the geographic region involves looking up 
the sequence in an appropriate database or journal 
article. However, for some isolates, the amino acid 

25 sequence information does not include the sequence of 
the V2 or C4 domains. 

When the amino acid sequence of a region of 
interest for a given isolate is not known, the amino 
acid sequence can be determined by well known methods. 

3 0 Methods for determining the amino acid sequence of a 
protein or peptide of interest are well known and are 
described in numerous references including Maniatis et 
al., Molecular Cloning — A Laboratory Manual, Cold 
Spring Harbor Laboratory (1984) . In addition, 

35 automated instruments which sequence proteins are 
commercially available. 
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Alternatively, the nucleotide sequence of DNA 
encoding gpl2 0 or a relevant portion of gpl2 0 can be 
determined and the amino acid sequence of gpl2 0 can be 
deduced . Methods for amplifying gpl2 0-encoding DNA 
5 from HIV isolates to provide sufficient DNA for 

sequencing are well known. In particular, Ou et al, 
Science 256:1165-1171 (1992); Zhang et al. AIDS 
5:675-681 (1991); and Wolinsky Science 255:1134-1137 
(1992) describe methods for amplifying gpl20 DNA. 

10 Sequencing of the amplified DNA is well known and is 
described in Maniatis et al., Molecular Cloning — A 
Laboratory Manual, Cold Spring Harbor Laboratory 
(1984), and Horvath et al., An Automated DNA 
Synthesizer Employing Deoxynucleoside 

15 3 1 -Phosphoramidites , Methods in Enzymology 154: 

313-326, (1987) , for example. In addition, automated 
instruments that sequence DNA are commercially 
available. 

20 In a preferred embodiment, the isolate is a 

patient isolate which has not been passaged in culture. ^ 
It is known that following passage in T-cells, HIV 4 
isolates mutate and isolates best suited for growth 
under cell culture conditions are selected. For 

25 example, cell culture strains of HIV develop the 

ability to form syncytia. Therefore, preferably the 
amino acid sequence of gpl2 0 is determined from a 
patient isolate prior to growth in culture. Generally, 
DNA from the isolate is amplified to provide sufficient 

3 0 DNA for sequencing. The deduced amino acid sequence is 
used as the amino acid sequence of the isolate, as 
described hereinbefore. 

To determine the percentage each isolate 
constitutes of total HIV that infects individuals in 

35 the geographic region, standard epidemiological methods 
are used. In particular, sufficient isolates are 
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sequenced to ensure confidence that the percentage of 
each isolate in the geographic region has been 
determined. For example, Ichimura et al, AIDS Res. 
Hum. Retroviruses 10:263-269 (1994) describe an 
5 epidemiological study in Thailand that determined that 
there are two strains of HIV present in the region. 
HIV strains have only recently been present in Thailand 
and Thailand, therefore has the most homogenous 
population of HIV isolates known to date. The study 

10 sequenced 23 isolates from various parts of the country 
and determined that only two different amino acid 
sequences were present in the isolates. 

In contrast, HIV has been infecting individuals in 
Africa for the longest period of any geographic region. 

15 In Africa, each of the most common isolates probably 

constitutes about 5% of the population. In such cases, 
more isolates would need to be sequenced to determine 
the percentage each isolate constitutes of the 
population. Population studies for determining the 

2 0 percentage of various strains of HIV, or other viruses, 
present in a geographic region are well known and are 
described in, for example, Ou et al, Lancet 341:1171- 
1174 (1993); Ou et al, AIDS -Res. Hum. Retroviruses 
8:1471-1472 (1992); and McCutchan et al., AIDS Res. 

25 Hum. Retroviruses 8:1887-1895 (1992). 

In the United States and western Europe, probably 
about two to four different neutralizing epitopes in 
each of the V2 , V3 , and C4 domains constitute 50 to 7 0% 
of the neutralizing epitopes for each domain in the 

30 geographic region, as described more fully hereinafter. 

Selection method 

Once the amino acid seguence of neutralizing 
epitopes for strains in a region are determined, gpl2 0 
35 from an HIV strain having gpl20 that has an amino acid 
segu nc for a neutralizing epitope in the V2 or C4 
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domain which sequence is one of the most common in the 
geographic region is selected. One of the most common 
neutralizing epitope amino acid sequences means that 
the strain has an amino acid sequence for at least one 
5 neutralizing epitope that is occurs among the most 
frequently for HIV isolates in the geographic region 
and thus is present as a significant percentage of the 
population. For example, if there are three sequences 
for a neutralizing epitope that constitute 20, 30, and 
10 40 percent of the sequences for that epitope in the 

region and the remainder of the population is comprised 
by 2 to 4 other sequences, the three sequences are the 
most common. Therefore, in African countries, if each 
of several amino acid sequences constitute about 5% of 
15 the sequences for a neutralizing epitope and the 

remainder of the sequences each constitute less than 1% 
of the population, the isolates that constitute 5% of 
the population are the most common. 

Preferably, isolates having the most common amino 
20 acid sequences for a neutralizing epitope are chosen. * 
By the most common is meant that the sequences occur - 
most frequently in the geographic region. For example, 
in the United States, the MN isolate has a C4 
neutralizing epitope that comprises at least about 45% 
25 of the population. The GNE 8 isolate has a C4 

neutralizing epitope that comprises at least about 4 5% 
of the population. Thus either isolate has the most 
common C4 neutralizing epitope in the region. When 
gpl20 from each isolate is combined in a vaccine, 
30 greater than about 90% of the C4 neutralizing epitope 

sequences are present in the vaccine. In addition, the 
amino acid sequences for the V3 neutralizing epitope in 
the MN and GNE 8 isolates are substantially similar and 
comprise about 60% of the population. Therefore, those 
35 strains have the two most common neutralizing epitopes 
for the V3 domain. In the V2 region, the MN isolate 
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amino acid sequences comprises about 10% of the 
population, and the GNE 8 isolate amino acid sequences 
comprises about 60% of the population. Therefore, the 
GNEg strain has the most common neutralizing epitope for 
5 the region and the two strains together comprise the 
two most common neutralizing epitopes for the region. 
A multivalent gpl20 subunit vaccine containing the two 
isolates contains amino acid sequences for epitopes 
that constitute about 7 0% of the V2 domain, about 60% 

10 of the V3 domain, and about 9 0% of the C4 domain for 
the United States. 

In a preferred embodiment of the method, one or 
more HIV isolates having an amino acid sequence for a 
neutralizing epitope in the V2 and/or C4 domains that 

15 constitute at least about 50% of the population for a 
selected geographic region are selected. In a more 
preferred embodiment, isolates having the most common 
neutralizing epitopes in the V3 domain are also 
included in the vaccine. 

20 As is clear, once the most common amino acid 

sequences for the neutralizing epitopes in the V2, V3, 
and C4 domains are known, an isolate having a common 
epitope for each region is preferably selected. That 
is, when only two or three isolates are used for the 

25 vaccine, it is preferable to select the isolate for 

common epitopes in each region, rather than selecting 
an isolate by analysis of a single region. 

In a more preferred embodiment, gpl20 from 
isolates having epitopes that constitute at least 50% 

30 of the population for the geographic region for V2 , V3 , 
and C4 domains are present in the vaccine. More 
preferably, the isolates have epitopes that constitute 
at least 60% of the population for the geographic 
region for the three domains. Most preferably, 70% or 

35 more are included. 
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In another preferred embodiment, the entire amino 
acid sequence of the V2 and C4 domains is determined in 
the selection process. In addition to selecting common 
sequences for the neutralizing epitopes , isolates 
5 having unusual polymorphisms elsewhere in the region 
are preferably not used for the vaccine isolates. 

Vaccine preparation 

gpl20 from the selected HIV isolate(s) is used to 

10 make a subunit vaccine, preferably a multivalent 

subunit vaccine. Preparation of gpl20 for use in a 
vaccine is well known and is described hereinafter. 
With the exception of the use of the selected HIV 
isolate, the gpl20 subunit vaccine prepared in the 

15 method does not differ from gp!20 subunit vaccines of - 
the prior art. 

As with prior art gpl2 0 subunit vaccines, gpl20 at 
the desired degree of purity and at a sufficient 
concentration to induce antibody formation is mixed 

2 0 with a physiologically acceptable carrier. A 

physiologically acceptable carrier is nontoxic to a 
recipient at the dosage and concentration employed in 
the vaccine. Generally, the vaccine is formulated for 
injection, usually intramuscular or subcutaneous 

25 injection. Suitable carriers for injection include 
sterile water, but preferably are physiologic salt 
solutions, such as normal saline or buffered salt 
solutions such as phosphate buffered saline or ringer r s 
lactate. The vaccine generally contains an adjuvant. 

30 Useful adjuvants include QS21 which stimulates 
cytotoxic T-cells and alum (aluminum hydroxide 
adjuvant) . Formulations with different adjuvants which 
enhance cellular or local immunity can also be used. 
Addition excipients that can be present in the 

3 5 vaccine include low molecular weight polypeptides (less 

than about 10 residues) , proteins, amino acids, 
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carbohydrates including glucose or dextrans, chelating 
agents such as EDTA, and other excipients. 

The vaccine can also contain other HIV proteins. 
In particular, gp41 or the extracellular portion of 
5 gp41 can be present in the vaccine. Since gp41 has a 
conserved amino acid sequence, the gp41 present in the 
vaccine can be from any HIV isolate. gpl60 from an 
isolate used in the vaccine can replace gp!20 in the 
vaccine or be used together with gpl2 0 from the 
10 isolate. Alternatively, gpl60 from an isolate having a 
different neutralizing epitope than those in the 
vaccine isolates can additionally be present in the 
vaccine. 

Vaccine formulations generally include a total of 

15 about 3 00 to 600 /xg of gpl20, conveniently in about 1.0 
ml of carrier. The amount of gpl20 for any isolate 
present in the vaccine will vary depending on the 
immunogenicity of the gp!20. For example, gp!20 from 
the Thai strains, of HIV are much less immunogenic than 

20 gpl20 from the MN strain. If the two strains were to 
be used in combination, empirical titration of the 
amount of each virus would be performed to determine 
the percent of the gpl2 0 of each strain in the vaccine. 
For isolates having similar immunogenicity , 

25 approximately equal amounts of each isolate 1 s gp!20 
would be present in the vaccine. For example, in a 
preferred embodiment, the vaccine includes gpl20 from 
the MN, GNE g , and GNE I6 strains at concentrations of 
about 300 jig per strain in about 1.0 ml of carrier. 

3 0 Methods of determining the relative amount of an 

immunogenic protein in multivalent vaccines are well 
known and have been used, for example, to determine 
relative proportions of various isolates in multivalent 
polio vaccines. 

35 The vaccines of this invention are administered in 

the same manner as prior art HIV gpl2 0 subunit 
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vaccines. In particular, the vaccines are generally 
administered at 0, 1, and at 6, 8 or 12 months, 
depending on the protocol. Following the immunization 
procedure, annual or bi-annual boosts can be 
5 administered. However, during the immunization process 
and thereafter, neutralizing antibody levels can be 
assayed and the protocol adjusted accordingly. 

The vaccine is administered to uninfected 
individuals. In addition, the vaccine can be 

10 administered to seropositive individuals to augment 
immune response to the virus, as with prior art HIV 
vaccines. It is also contemplated that DNA encoding 
the strains of gpl2 0 for the vaccine can be 
administered in a suitable vehicle for expression in 

15 the host. In this way, gpl2 0 can be produced in the 
infected host, eliminating the need for repeated 
immunizations. Preparation of gpl2 0 expression 
vehicles is described hereinafter. " * 

20 Production of crpl20 

gpl2 0 in the vaccine can be produced by any 
suitable means, as with prior art HIV gpl2 0 subunit 
vaccines. Recombinantly-produced or chemically 
synthesized gpl20 is preferable to gpl20 isolated 

25 directly from HIV for safety reasons. Methods for 

recombinant production of gpl20 are described below. 

DNA Encoding GNE 8 and GNE^ crol20 
and the resultant proteins 
3 0 The present invention also provides novel DNA 

sequences encoding gpl2 0 from the GNE 8 and GNE 16 isolates 
which can be used to express gpl20 and the resultant 
gpl20 proteins. A nucleotide sequence of less than 
about 5 kilobases (Kb) , preferably less than about 3 Kb 
35 having the nucleotide sequence illustrated in Tables 1 
and 2, respectively, encodes gpl20 from the GNE 8 and 
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GNE 16 isolates. The sequences of the genes and the 
encoded proteins are shown below in Tables 1-3. In 
particular, Table 1 illustrates the nucleotide sequence 
(SEQ. ID. NO. 27) and the predicted amino acid sequence 
5 (SEQ. . ID. NO. 28) of the GNE 8 isolate of HIV. The upper 
sequence is the coding strand. The table also 
illustrates the location of each of the restriction 
sites. 
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Table 2 illustrates the nucleotide sequence and 
the predicted amino acid sequence of the GNE 16 isolate 
of HIV. The upper sequence is the coding strand. The 
table also illustrates the location of each of the 
5 restriction sites. The first four pages of the table 
are from one clone of the gene and the second three 
pages of the table are from another clone of the gene. 
The sequences of the clones differ by about 2%. (The 
nucleotide sequences are SEQ. ID. NOs. 28 and 29/ 
10 respectively. The amino acid sequences are SEQ. ID. 

NOs. 30 and 31, respectively.) It is noted that each 
of the sequences includes a stop codon. A gene 
sequence that encodes full length gpl2 0 can be made by 
repairing one of the sequences. 
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Table 3 illustrates the amino acid sequences for 
the MN, GNEg , and GNE 16 gpl2 0 proteins. The regions of 
the sequences having identical amino acid sequences are 
enclosed in boxes. 

TABLE 3 
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Nucleic acid sequences encoding gpl20 from GNE 8 and 
GNE 16 capable of expressing gpl20 can be prepared by 
conventional means. The nucleotide sequence can be 
5 synthesized . Alternatively, another HIV nucleic acid 
sequence encoding gpl2 0 can be used as a backbone and 
altered at any differing residues by site directed 
mutagenesis as described in detail in Example 1. 

In a preferred embodiment, the nucleotide sequence 

10 is present in. an expression construct containing DNA 
encoding gpl2 0 under the transcriptional and 
translational control of a promoter for expression of 
the encoded protein. The promoter can be a eukaryotic 
promoter for expression in a mammalian cell. In cases 

15 where one wishes to expand the promoter or produce 
gpl2 0 in a prokaryotic host, the promoter can be a 
prokaryotic promoter. Usually a strong promoter is 
employed to provide high level transcription and 
expression. 

20 The expression construct can be part of a vector 

capable of stable extrachromosomal maintenance in an 
appropriate cellular host or may be integrated into 
host genomes. Normally, markers are provided with the 
expression construct which allow for selection of a 

25 host containing the construct. The marker can be on 
the same or a different DNA molecule, desirably, the 
same DNA molecule. 

The expression construct can be joined to a 
replication system recognized by the intended host 

3 0 cell. Various replication systems include viral 

replication systems such as retroviruses, simian virus, 
bovine papilloma virus, or the like. In addition, the 
construct may be joined to an amplifiable gene, e.g. 
DHFR gene, so that multiple copies of the gpl2 0 DNA can 

35 be made. Introduction of the construct into the host 
will vary depending on the construct and can be 
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achieved by any convenient means- A wide variety of 
prokaryotic and eukaryotic hosts can be employed for 
expression of the proteins. 

Preferably, the gpl2 0 is expressed in mammalian 
5 cells that provide the same glycosylation and disulfide 
bonds as in native gpl20. Expression of gpl20 and 
fragments of gpl20 in mammalian cells as fusion 
proteins incorporating N-terminal sequences of Herpes 
Simplex Virus Type 1 (HSV-1) glycoprotein D (gD-1) is 

10 described in Lasky, L. A. et al., 1986 (Neutralization 
of the AIDS retrovirus by antibodies to a recombinant 
envelope glycoprotein) Science 233: 209-212 and Haffar, 
O.K. et al., 1991 (The cytoplasmic tail of HIV-1 gpl60 
contains regions that associate with cellular 

15 membranes.) Virol. 180:4 39-441, respectively. A 

preferred method for expressing gpl20 is described in 
Example 3. In the example, a heterologous signal 
sequence was used for convenient expression of the 
protein. However, the protein can also be expressed 

20 using the native signal sequence. 

An isolated, purified GNE 8 -gpl20 and GNE 16 -gpl20 
having the amino acid sequence illustrated in Tables 
1-3 can be produced by conventional methods. For 
example, the proteins can be chemically synthesized. 

25 In a preferred embodiment, the proteins are expressed 
in mammalian cells using an expression construct of 
this invention. The expressed proteins can be purified 
by conventional means. A preferred purification 
procedure is described in Example 3. 

30 

gpl20 Fragments 
The present invention also provides gp!20 
fragments that are suitable for use in inducing 
antibodies for use in serotyping or in a vaccine 
35 formulation. A truncated gpl20 sequence as used herein 
is a fragment of gpl2 0 that is free from a portion of 
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the intact. gpl2 0 sequence beginning at either the amino 
or carboxy terminus of gp!2 0. A truncated gp!2 0 
sequence of this invention is free from the C5 domain. 
The C5 domain of gpl20 is a major immunogenic site of 
5 the molecule. However, antibodies to the region do not 
neutralize virus. Therefore, elimination of this 
portion of gpl2 0 from immunogens used to induce 
antibodies for serotyping is advantageous. 

In another embodiment, the truncated gpl20 

10 sequence is additionally free from the carboxy terminus 
region through about amino acid residue 453 of the 
gpl20 V5 domain. The portion of the V5 domain 
remaining in the sequence provides a convenient 
restriction site for preparation of expression 

15 constructs. However, a truncated gpl2 0 sequence that 
is free from the entire gpl20 V5 domain is also 
suitable for use in inducing antibodies. 

In addition, portions of the carboxy terminus of 
gpl2 0 can also be eliminated from the truncated gpl2 0 

2 0 sequence. The truncated gp!20 sequence can 

additionally be free from the gpl20 signal sequence. 
The truncated gpl20 sequence can be free from the 
carboxy terminus through amino acid residue 111 of the 
gpl2 0 CI domain, eliminating most of the CI domain but 

2 5 preserving a convenient restriction site. However, the 

portion of the CI domain through the cysteine residue 
that forms a disulfide bond can additionally be 
removed, so that the truncated gp!20 sequence is free 
from the carboxy terminus through amino acid residue 
30 117 of the gpl20 CI domain. Alternatively, the 

truncated gpl2 0 sequence can be free from the amino 
terminus of gpl20 through residue 111 of the CI domain, 
preserving the V2 disulfide bond. In a preferred 
embodiment, the truncated gpl2 0 sequence is free from 

3 5 the amino terminus of gpl20 through residue 111 of the 
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Cl domain and residue 453 through the carboxy terminus 
of gp!20. 

The truncated gpl2 0 sequences can be produced by 
recombinant engineering, as described previously. 
5 Conveniently, DNA encoding the truncated gpl2 0 sequence 
is joined to a heterologous DNA sequence encoding a 
signal sequence. 

Serotypina method 

10 The present invention also provides an improved 

serotyping method for HIV strains. The method 
comprises determining the serotypes of the V2 , V3 , and 
C4 domains of gpl2 0. 

HIV isolates can be serotyped by conventional 

15 immunoassay methods employing antibodies to the 

neutralizing epitopes in the V2 , V3 , and C4 domains for 
various strains of HIV. Preparation of the antibodies 
is described hereinbefore. The antibody affinity 
required for serotyping HIV using a particular 

20 immunoassay method does not differ from that required 1 
to detect other polypeptide analytes. The antibody 
composition can be polyclonal or monoclonal, preferably 
monoclonal. 

A number of different types of immunoassays are 
25 well known using a variety of protocols and labels. 
The assay conditions and reagents may be any of a 
variety found in the prior art. The assay may be 
heterogeneous or homogeneous. Conveniently, an HIV 
isolate is adsorbed to a solid phase and detected with 
30 antibody specific for one strain of neutralizing 

epitope for each neutralizing epitope in the V2, V3, 
and C4 domain. Alternatively, supernatant or lysate 
from the cultured isolate which contains gpl20 can be 
adsorbed to the solid phase. The virus or gpl20 can be 
35 adsorbed by many well known non-specific binding 
methods. Alternatively, an anti-gpl20 antibody, 
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preferably directed to the carboxy terminus of gpl2 0 
can be used to affix gpl20 to the solid phase. A gpl20 
capture antibody and sandwich ELISA assay for gpl20 
neutralizing epitopes is described by Moore, AIDS Res. 
5 Hum. Retroviruses 9 : 209-219 (1993). Binding between 

the antibodies and sample can be determined in a number 
of ways. Complex formation can be determined by use of 
soluble antibodies specific for the anti-gpl20 
antibody. The soluble antibodies can be labeled 

10 directly or can be detected using labeled second 

antibodies specific for the species of the soluble 
antibodies. Various labels include radionuclides, 
enzymes, fluorescers, colloidal metals or the like. 
Conveniently, the anti-gp!2 0 antibodies will be labeled 

15 directly, conveniently with an enzyme. 

Alternatively, other methods for determining the 
neutralizing epitopes can be used. For example, 
fluorescent-labeled antibodies for a neutralizing 
epitope can be combined with cells infected by the 

2 0 strain of HIV to be serotyped and analyzed by 

fluorescence activated cell sorting. 

The serotype of the HIV isolate includes the 
strain of the neutralizing epitopes for the V2, V3 , and 
C4 domains. 

25 

It is understood that the application of the 
teachings of the present invention to a specific 
problem or situation will be within the capabilities of 
one having ordinary skill in the art in light of the 

3 0 teachings contained herein. Examples of the products 

of the present invention and representative processes 
for their isolation, use, and manufacture appear below, 
but should not be construed to limit the invention. 
All literature citations herein are expressly 
35 incorporated by reference. 
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EXAMPLE 1 

Identification of C4 Neutralizing Epitopes 
The following reagents and methods were used in 
the studies described herein* 

5 

gpl2 0 sequences and nomenclature. Amino acid 
residues are designated using the standard single 
letter code. The location of amino acids within the 
gpl2 0 protein is specified using the initiator 

10 methionine residue as position 1. The designation LAI 
is used to describe the virus isolate from which the 
HIV-1 BH10 , HIV- Inn,/ HIV-1 BRU , HIV-l^, HIV-l^ and 
HIV- Ihxbio substrains (molecular clones) of HIV-1 were 
obtained. The sequence of gpl2 0 from IIIB substrain of 

15 HIV-l^j is that determined by Muesing et al . (30). 

The sequence of gpl20 from MN strain of HIV-1 is 
given with reference to the MNgpl20 clone (MNqne) . The 
sequence of this clone differs by approximately 2% from 
that of the MN 19M clone described by Gurgo et al. (13). 

20 The sequences of gpl20 from the NY-5, JRcsf, 26, 2321, 
and HXB2 strains of HIV-l are those listed by Myers et 
al . (32) except where noted otherwise. The sequence of 
the Thai isolate A244 is that provided by McCutchan et ' 
al. (24). The variable (V) domains and conserved (C) 

25 domains of gpl20 are specified according to the 
nomenclature of Modrow et al . (28). 

Monoclonal antibody production and screening 
assays. Hybridomas producing monoclonal antibodies to 

30 MN-rgpl20 (recombinantly produced gpl20 from the MN 

strain of HIV) (3) were prepared and screened for CD4 
blocking activity as described previously (7, 33) . The 
binding of monoclonal antibodies to MN-rgpl20 and to 
rgpl20s from the IIIB, NY-5, Z6, Z321, JRcsf, and A244 

35 strains of HIV-1 was assessed by enzyme linked 
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immunoadsorbant assays (ELISA) as described previously 
(33) . 

Virus binding and neutralization assays. The 

5 ability of monoclonal antibodies to neutralize HIV-1 

infectivity in vitro was assessed in a colorimetric MT- 
2 cell cytotoxicity assay similar to that described 
previously (35). MT-2 cells and H9 /HTLV— III^ cells 
were obtained through the AIDS Research and Reference 

10 Reagent Program, Division of AIDS, NIAID, NIH: 

contributed by Drs. Douglas Richman and Robert Gallo, 
respectively. Briefly, serial dilutions of antibody or 
serum were prepared in 50 jzl volumes of complete and 
then 50 ^1 of a prediluted HIV-1 stock was added to 

15 each well. After incubation for 1 hr at 4°C, 100 /xl of 
a 4 x 10 5 MT-2 cell/ml suspension was added. After 
incubation of the plates for 5 days at 37 °C in 5% C0 2 , 
viable cells were measured using metabolic conversion 
of the formazan MTT dye. Each well received 20 fil of a 

20 5 mg/ml MTT solution in PBS. 

After a 4 hr incubation at 37 °C, the dye 
precipitate was dissolved by removing 100 /il of the 
cell supernatant, adding 130 jil of 10% Triton X-100 in 
acid isopropanol, then pipeting until the precipitate 

25 was dissolved. The optical density of the wells was 
determined at 540 nm. The percentage inhibition was 
calculated using the formula: 

1- (virus control-experimental) 
(virus control -medium control) 

30 

Cell surface staining of HIV— 1 infected cells with 
monoclonal antibodies. H9 cells (2 x 10 5 ) chronically 
infected with the IIIB, HXB2, HXB3 , and HX10 substrains 
of HIV-1 UI or with HIV-Imn were incubated for 3 0 min at 
3 5 room temperature with monoclonal antibodies (10 fig per 
ml) in 100 /il of RPMI 164 0 cell culture media 
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containing 1% FCS . Cells were washed and then 
incubated with 20 fig per ml of f luorescein-con jugated, 
af f inity-purif ied, goat antibody to mouse IgG (Fab') 2 
(Cappel, West Chester, PA) for 3 0 min. Cells were 
5 washed, fixed with 1% paraformaldehyde and the bound 
antibody was quantitated by flow cytometry using a 
FACSCAN (Becton-Dickenson, Fullerton, CA) . 

Fluorescence data was expressed as percentage of 
fluorescent cells compared to the fluorescence obtained 
10 with the second antibody alone. Fluorescence was 

measured as the mean intensity of the cells expressed 
as mean channel number plotted on a log scale. 

Fragmentation of the MN-rgpl2 0 gene. Fragments of 
15 the MN-rgpl2 0 gene were generated using the polymerase 
chain reaction (PCR) (17) . Briefly, forward 3 0-mer 
oligonucleotide DNA primers incorporating a Xho 1 site/ 
and reverse 3 6-mer oligonucleotide DNA primers 
containing a stop codon followed by a Xba 1 site were 

2 0 synthesized and used for the polymerase chain 

reactions. Thirty cycles of the PCR reaction were 
performed using 0.3 /ig of a plasmid containing the gene 
for gpl20 from the MN strain of HIV-1 (pRKMN. D533) and 
0.04 nM of a designated primers. The PCR reaction 
25 buffer consisted of 0.1 M Tris buffer (pH 8.4), 50 mM 
KC1, 0.2 mM 4dNTP (Pharmacia, Piscataway, NJ) , 0.15 M 
MgCl 2 and 0.5 Unit of Taq Polymerase (Perkin-Elmer 
Cetus , Norwalk, CN) and a typical PCR cycle consisted 
of a 60 second denaturation step at 94°C, followed by a 

3 0 4 5 second annealing step at 55° C, and then an 

extension step at 72° C for 45 seconds. 

Following the PCR amplification, the PCR products 
were purified by phenol and chloroform extraction, and 
then ethanol precipitated. The purified products were 
35 then digested with the restriction endonucleases Xhol 

and Xbal. The resulting PCR products were gel purified 
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using 1% agarose (SEAKEM, FMC Bioproducts, Rockland, 
ME) or 5% polyacrylamide gel electrophoresis (PAGE) and 
then isolated by electroelution. 

5 Site directed mutagenesis of the MN~rgpl20 C4 

domain. A recombinant PCR technique (15) was utilized 
to introduce single amino acid substitutions at 
selected sites into a 600 bp Bgl II fragment of 
MN-rgpl2 0 that contained the C4 domain. This method 

10 entailed the PCR amplification of overlapping regions 
of the C4 domain of gpl20 using primers that 
incorporated the desired nucleotide changes. The 
resultant PCR products were then annealed and PCR 
amplified to generate the final product. For these 

15 reactions 18-mer "outside" primers encoding the wild 

type sequence (Bgl II sites) were amplified with 36-mer 
"inside" primers that contained the alanine or glutamic 
acid residue changes. The first PCR reaction included 
IX of the Vent polymerase buffer (New England Biolabs, 

20 Beverly, MA), 0.2 mM of 4dNTP (Pharmacia, Piscataway, 
N.J.), 0.04 nM of each synthetic oligonucleotide, 0.3 
tig of linearized plasmid, pRKMN.D53 3, which contained 
the MN-rgpl2 0 gene. Thirty PCR cycles were performed 
consisting of the following sequence of steps: 45 

25 seconds of denaturation at 94 *C, 45 second of annealing 
at 55 °C and 45 seconds of extension at 72 °C. Following 
PCR amplification, the product pairs were gel purified 
using a 1% solution of low melt agarose (SeaPlaque, FMC 
Bioproducts, Rockland, ME) . 

30 The agarose containing PCR product was melted at 

65 °C and combined with the PCR product of the 
overlapping pair and equilibrated to 37 °C. Added to 
this (20 fxl) was 10 /xl of 10X Vent Polymerase buffer, 
10 jil of 2 mM 4dNTP,. 0.04 nM each of the "outside" wild 

35 type 18 mer oligonucleotides, 57 fil of H 2 0 and 1 unit of 
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Vent Polymerase. Thirty PCR cycles were performed as 
previously above* 

The resulting PCR products were purified and 
digested with the Bgl II endonuclease. The digested 
5 PCR product was then ligated into the mammalian cell 
expression vector pRKMN.D533, which had been digested 
with Bgl II allowing for the removal of a 600 bp 
fragment. Colonies containing the correct insertion 
were identified and Sequenase 2.0 supercoil sequencing 
10 was employed to check for fidelity and the 
incorporation of the desired mutation. 

Expression of gpl2 0 fragments in mammalian cells. 

Fragments of the MN and IIIB gpl20 were expressed in 

15 mammalian cells as fusion proteins incorporating 

N-terminal sequences of Herpes Simplex Virus Type 1 -- : 
(HSV-l) glycoprotein D (gD-1) as described previously ^ 
(14, 22). Briefly, isolated DNA fragments generated by ~ 
the PCR reaction were ligated into a plasmid (pRK.gD-1) 

20 designed to fuse the gpl20 fragments, in frame, to the * 
5* sequences of the glycoprotein D (gD) gene of Type 1 
Herpes Simplex Virus (gD-1) and the 3' end to 
translational stop codons. The fragment of the gD-l *•*- 
gene encoded the signal sequence and 2 5 amino acids of 

25 the mature form of HSV-l protein. To allow for 

expression in mammalian cells, chimeric genes fragments 
were cloned into the pRK5 expression plasmid (8) that 
contained a polylinker with cloning sites and 
translational stop codons located between a 

3 0 cytomegalovirus promotor and a simian virus 40 virus 
polyadenylation site. 

The resulting plasmids were transfected into the 
2 93s embryonic human kidney cell line (12) using a 
calcium phosphate technique (11) . Growth conditioned 

3 5 cell culture media was collected 48 hr after 

transf ection, and the soluble proteins were detected by 
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ELISA or by specific radioimmunoprecipitation where 
metabolically labeled proteins from cell culture 
supernatants were resolved by sodium dodecyl sulfate 
polyacrylamide gel electrophoresis (PAGE) and 
5 visualized by autoradiography as described previously 
(1, 18). 

Radioimmunoprecipitation of MN-rgpl2 0 mutants. 

Plasmids directing the expression of the MN-rgpl2 0 C4 

10 domain mutants were transfected into 293s cells as 
described above. Twenty four hours following the 
transf ection, the cells were metabolically labeled with 
[ 35 S] -labeled methionine or cysteine as described 
previously (1) . The labeled cell culture supernatants 

15 were then harvested and 0.5 ml aliquots were reacted 

with 1-5 /xg of the monoclonal antibody or with 2 fil of 
the polyclonal rabbit antisera to MN-rgpl20 and 
immunoprecipitated with Pansorbin (CalBiochem, La 
Jolla, CA) as described previously (1) . The resulting 

20 Pansorbin complex was pelleted by centrif ugation, 

washed twice with a solution containing PBS, 1% NP-40 
and 0.05% SDS and then boiled in a PAGE sample buffer 
containing 1% 2-mercaptoethanol . The processed samples 
were the analyzed by SDS-PAGE and visualized by 

25 autoradiography (1, 18) . 

Assays to measure the binding of monoclonal 
antibodies to mutagenized MN-rgpl20 polypeptides. An 

ELISA was developed to screen for reactivity of 
3 0 MN-rgpl20 fragments and mutant proteins with various 
monoclonal antibodies. In this assay, 96 well 
microtiter dishes (Maxisorp, Nunc, Roskilde, Denmark) 
were coated overnight with mouse monoclonal antibody 
(5B6) to gD-1, at a concentration of 2.0 /xg/ml in 
35 phosphate buffered saline (PBS) . The plates were 

blocked in a PBS solution containing 0.5% bovine serum 
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albumin (PBSA) and then incubated with growth 
conditioned cell culture medium from transfected cells 
expressing the recombinant gpl2 0 variants for 2 hr at 
room temperature. The plates were washed three times 
5 in PBS containing 0.05% Tween 20 and then incubated 
with the purified, HRPO-conjugated monoclonal 
antibodies. Following a 1 hr incubation, the plates 
were washed three times and developed with the 
color imetric substrate, o-phenylenediamine (Sigma, St, 

10 Louis , MO) . 

The optical densities in each well were then read 
in a microtiter plate reading spectrophotometer at 
492 nm. Each cell culture supernatant containing 
fragments or mutated rgpl2 0s was normalized for 

15 expression based on the titering of its reactivity to 

the V3 binding monoclonal antibody 1034 or to a rabbit,* 
polyclonal antisera to MN-rgpl20. Data from these r_ 
experiments were expressed as a ratio of the optical 
densities obtained with the CD4 blocking monoclonal 

20 antibodies binding to the fragments or MN-rgpl20 
mutants compared with the full length wild type 
rgpl20s. 

To normalize for different concentrations of 
MN-rgpl2 0-derived protein in the cell culture 

2 5 supernatants , the binding of the CD4 blocking 

monoclonal antibodies to each preparation was compared 
to that of an HRPO-conjugated monoclonal antibody to 
the V3 domain of MN-rgpl2 0 (1034) . Data from these 
experiments were expressed as a ratio of the optical 

3 0 densities obtained with the CD4 blocking monoclonal 

antibodies to the HRPO conjugated V3 reactive 
monoclonal antibody. 

CD 4 binding assays. The ability of monoclonal 
3 5 antibodies to inhibit the binding of MN-rgpl20 to 

recombinant soluble CD4 (rsCD4) was determined in a 
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solid phase radioimmunoassay similar to that described 
previously (33) . The effect of single amino acid 
substitutions on the binding of MN-rgpl2 0 mutants to 
CD4 was determined in a co-immunoprecipitation assay 
5 similar to that described previously (21) . Briefly, 

293 cells were metabolically labeled with 35 S-methionine 
24 hr after transfection with plasmids expressing 
MN-rgpl20 variants. Growth conditioned cell culture 
medium (0-5 ml) was then incubated with 5.0 jig of 

10 recombinant sCD4 for 90 minutes at room temperature. 
Following this incubation, 5.0 /xg of an anti-CD4 
monoclonal antibody (465) , known to bind to an epitope 
remote from the gpl2 0 binding site, was added and 
allowed to incubate another 90 minutes at room 

15 temperature. 

The gpl20-CD4 -antibody complexes were precipitated 
with Pansorbin that had been washed with PBS, 
preabsorbed with 0.1% bovine serum albumin and then 
bound with 50 /xg of an affinity purified rabbit anti- 

20 mouse IgG (Cappel, West Chester, PA) . The pellet was 
washed twice with PBS 1% NP-4 0, 0.05% SDS, and then 
boiled in beta mercaptoethanol containing SDS-PAGE 
sample buffer. The immunoprecipitation products were 
resolved by SDS PAGE and visualized by autoradiography 

25 as described previously (1,21). 

Antibody affinity measurements. Anti-gpl20 
antibodies were iodinated with Na 125 I with iodogen* 
(Pierce, Rockford, IL) . Briefly, 50 /xg of antibody in 

30 PBS was placed in 1.5 ml polypropylene microcentrifuge 
tubes coated with 50 ftg of Iodogen. Two miilicuries of 
carrier free Na[ ,25 I] was added. After 15 min. , free I25 I 
was separated from the labeled protein by 
chromatography on a PD-10 column (Pierce, Rockford, IL) 

35 pre-equi libra t d in PBS containing 0.5% gelatin. 
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Antibody concentrations following iodination were 
determined by ELISA to calculate specific activities. 

For binding assays, 96-well microtiter plates were 
coated with 100 /il/well of a 10 /xg/ml solution of 
5 MN-rgpl20 or IIIBrgpl20 in 0.1 M bicarbonate buffer, 
pH 9*6 and incubated for 2 hr at room temperature or 
overnight at 4°C. To prevent non-specific binding, 
plates were blocked for 1-2 hr at room temperature with 
200 ^tl/well of a gelatin solution consisting of PBS 

10 containing 0.5% (wt/vol) gelatin and 0.02% sodium 

azide. Unlabeled anti-gpl2 0 monoclonal antibody (0 to 
4 00 nM) was titrated (in duplicate) in situ and 
radiolabeled antibody was added to each well at a 
concentration of 0.5 nM. 

15 After a 1-2 hr incubation at room temperature, the 

plate was washed lOx with the PBS/ 0.5% gelatin/0.02% 
azide buffer to remove free antibody. The antibody- 
gpl20 complexes were solubilized with 0.1 N NaOH/0.1% 
SDS solution and counted in a gamma counter. The data 

2 0 were analyzed by the method of Scatchard (40) using the 

Ligand analytical software program (31) . K d values 
reported represent the means of four independent 
determinations. 

25 RESULTS 

Characterization of monoclonal antibodies to MN-rgpl2 0 
that block CD 4 binding. Monoclonal antibodies prepared 
from mice, immunized with MN-rgpl20 (3, 33), were 
screened for the ability to bind to MN-rgpl20 coated 

3 0 microtiter dishes by ELISA as described previously 

(33) . Of the thirty five clones obtained, seven were 
identified (1024, 1093, 1096, 1097, 1110, 1112, and 
1127) that were able to inhibit the binding of 
MN-rgpl2 0 to recombinant CD4 in ELISA (Figure 1) or 
35 solid phase or cell surface radioimmunoassays (21, 33). 
Previous studies have shown that two distinct classes 
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of CD4 blocking monoclonal antibodies occur: those that 
bind to conformation dependent (discontinuous) epitopes 
(16, 26, 33, 35, 4 5) and those that bind to 
conformation independent (sequential) epitopes (4, 7, 
5 21, 33, 43). 

To distinguish between these two alternatives, the 
binding of the monoclonal antibodies to denatured 
(reduced and carboxymethylated) MN-rgpl20 (RCM-gpl20) 
was measured by ELISA as described previously (33) . As 
10 illustrated in Table 4, below, it was found that all of 
the CD4 blocking monoclonal antibodies reacted with the 
chemically denatured protein; indicating that they all 
recognized conformation independent (sequential) 
epitopes . 

15 
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Table 4 

Properties of monoclonal antibodies to MN-rgpl2 0 





CD 4 


HIV-1 mn 






C4 


rgl20 




Inhi- 


Neutral- 


HIV-1 mn 


CM- 


Domain 


cross 


MAb 


Id itors 


i z at ion 


V3 


raol20 


pBpt ides 


react ivity 


1024 


+ 


+ 


— 


+ 


— 


2 


1093 


+ 




— 


+ 




2 


1096 


+ 


+ 




+ 




2 


1097 




+ 




+ 




2 


1110 


+ 


+ 




+ 




2 


1112 


+ 


+ 




+ 




2 


1127 


+ 


+ 




+ 




2 


1026 




+ 




+ 




1,2,3,4,6 


1092 








+ 




1,2,3,4,5 


1126 








+ 




1/2,3,5,7 


1086 








+ 




2 


13H8 


+ 








1,3 


1,2,3,4,5,6,7 



20 

rgpl20 cross reactivity: 1, IIIB-rgl20; 2, MN-rgpl20, 
3, NYS-rgpl20; 4, JrCSF-rgpl20 ; 5, Z6-rgpl20; 6, Z321- 
rgpl20; 7, A244-rgpl20 

25 

C4 domain peptides: 

1, FINMWQEVGKAMYAPPIS (SEQ. ID. NO. 24); 

2, MWQEVGKAMYAP (SEQ. ID. NO. 25) ; 

3, GKAMYAPPIKGQIR (SEQ. ID. NO. 26) 

30 

The cross reactivity of thesis monoclonal 
antibodies was assessed by ELISA as described 
previously (33) . In these experiments, the ability of 
the monoclonal antibodies to bind to a panel of seven 

35 rgpl20s, prepared from the IIIB, MN, Z6, Z321 / NY-5, 
A244, and JRcsf isolates of HIV-1, was measured by 
ELISA (33). It was found that all of the CD4 blocking 
monoclonal antibodies were strain specific and bound 
only to gpl20 from the MN strain of HIV-1 (Table 4) . 

40 However, other antibodies from the same fusion 

-56- 



BNSDOCID: <WO 9428929A1 J_> 



WO 94/28929 



PCT/US94/06036 



(1026,1092, and 1126) exhibited much broader cross 
reactivity (Table 4, Figure 2), as did a CD4 blocking 
monoclonal antibody to IIIB-rgpl20 (13H8) described 
previously (33). 
5 Further studies were performed to characterize the 

neutralizing activity of the antibodies to MN-rgpl2 0. 
In these studies, monoclonal antibodies were incubated 
with cell free virus (HIV-l^) , and the resulting 
mixture was then used to infect MT-2 cells in 

10 microtiter plates. After 5 days, the plates were 

developed by addition of the color imetric dye, MTT, and 
cell viability was measured spectrophotometrically . It 
was found (Table 4, Figure 2) that all of the CD4 
blocking monoclonal antibodies were able to inhibit 

15 viral infectivity. However the potency of the 

monoclonal antibodies varied considerably with some 
monoclonal antibodies (eg. 1024) able to inhibit 
infection at very low concentrations (IC& of 0.08 fig 
per ml) whereas other monoclonal antibodies (eg. 1112) 

2 0 required much higher concentrations (ICjo of 3 0 fig per 
ml) . In control experiments two monoclonal antibodies 
to MN-rgpl20 from the same fusion (eg. 1086, 1092) were 
ineffective, whereas the 1026 monoclonal antibody 
exhibited potent neutralizing activity. Similarly, 

25 monoclonal antibodies to the V3 domain of IIIB-rgpl20 

(10F6, 11G5) known to neutralize the infectivity HlV-lm, 
(33) , were unable to neutralize the HIV-lj^ virus. 

Binding studies using synthetic peptides were then 
performed to further localize the epitopes recognized 

30 by these monoclonal antibodies as described previously 
(33). When a peptide corresponding to the V3 domain 
(3) of MN-rgpl20 was tested, it was found that none of 
the CD4 blocking antibodies showed any reactivity. 
However the epitope recognized by the non-CD4 blocking 

35 monoclonal antibody, 1026, prepared against MN-rgpl20 
could be localized to the V3 domain by virtue of its 
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binding to this peptide. In other experiments, three 
synthetic peptides from the C4 domain of gpl2 0 that 
incorporated sequences recognized by the CD4 blocking, 
weakly neutralizing monoclonal antibodies described by 
5 McKeating et al. (26) were tested (Table 4) • It was 
found that none of the CD4 blocking monoclonal 
antibodies to MN-rgpl20 reacted with these peptides, 
however the non-neutralizing, CD4 blocking 13H8 
monoclonal antibody bound to the peptides corresponding 

10 to residues 423-440 of IIIB-gpl20 and residues 431-441 
of MN-gpl20, but not to that corresponding to residues 
426-437 of IIIB-gpl20. Thus the 13H8 monoclonal 
antibody recognized a epitope that was similar, if not 
identical, to that described by McKeating et al. (26) . 

15 This result is consistent with the observation that the 
13H8 monoclonal antibody and the monoclonal antibodies 
described by Cordell et al. (4) and McKeating et al. . 
(26) exhibited considerable cross reactivity, whereas 
the antibodies to MN-rgpl20 were highly strain 

20 specific. 

CD4 blocking antibodies recognize epitopes in the 
C4 domain. Previously, a strain specific, CD4 blocking 
monoclonal antibody (5C2) raised against IIIB-rgpl2 0 
was found to recognize an epitope in the C4 domain of 
IIIB-rgpl20 (21, 33) . Although the 5C2 monoclonal 
antibody was able to block the binding of rgpl20 to 
CD4, it was unable to neutralize HIV-1 infectivity in 
vitro (7) . Affinity columns prepared from 5C2 adsorbed 
an 11 amino acid peptide (residues 422 to 4 32) from a 
tryptic digest of gpl2 0 (21) , however monoclonal 
antibody 5C2 was unable to recognize this peptide 
coated onto wells of microtiter dishes in an ELISA 
format (Nakamura et al. , unpublished results) . 

To determine whether the CD 4 blocking monoclonal 
antibodies raised against MN-rgpl20 recognized the 
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corresponding epitope in the C4 domain of MN-rgpl20, a 
series of overlapping fragments, spanning the V4 and C4 
domains of HIV-1,^ gpl2 0, were prepared for expression 
in mammalian cells. A diagram of the fragments 
5 expressed is shown in Figures 3 A and 3B. The C4 domain 
fragments were expressed as fusion proteins that 
incorporated the signal sequence and amino terminal 25 
amino acids of HSV-1 glycoprotein D as described above. 
Plasmids directing the expression of the chimeric 

10 C4 domain fragments were transfected into 293 cells, 
and their expression was monitored by 
radioimmunoprecipitation studies where a monoclonal 
antibody, 5B6, specific for the mature amino terminus 
of glycoprotein D was utilized. It was found 

15 (Figure 3B) that all of the fragments were expressed 
and exhibited mobilities on SDS-PAGE gels appropriate 
for their size. Thus f MN. 368-408 (lane 1) exhibited a 
mobility of 19 kD; fMN. 368-451 (lane 2) exhibited a 
mobility of 29 kD; fMN. 419-4 33 (lane 3) exhibited a 

20 mobility of 6 kD, and fMN. 414-451 (lane 4) exhibited a 
mobility of 6.1 kD. 

The binding of monoclonal antibody 1024 to the 
recombinant fragments was then determined by ELISA (as 
described in Example 1) . It was found (Figure 3 A) that 

25 monoclonal antibody 1024 reacted with the fragments 

that contained the entire C4 domain of MN-rgpl2 0 (fMN 368 _ 
451 , fMN 4W-4$J ) , but failed to bind to a fragment derived 
from the adjacent V4 domain (fMN 36 ^ 408 ) or to another 
fragment that contained V4 domain sequences and the 

30 amino terminal half of the C4 domain (fMN 368 ^ 28 ) . The 

fact that 1024 bound to the fMN 4l4 ^ 5l and fMN 41M43 fragments 
demonstrated that the epitopes recognized by all of 
these monoclonal antibodies were contained entirely 
between residues 419 and 44 3 in the C4 domain. 
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Residues recognized by monoclonal antibodi s that 
block binding of MN-rgpl2 0 to CD4 • To identify 
specific amino acid residues that might be part of the 
epitopes recognized by these monoclonal antibodies, the 
5 sequence of the C4 domain of MN-rgpl2 0 was compared to 
those of the gpl20s from the six other rgpl20s that 
failed to react with the CD4 blocking monoclonal 
antibodies (Figure 4) . It was noted that the sequence 
of MN-rgpl2 0 was unique in that K occurred at position 

10 429 whereas the other rgp!20s possessed either E,G, or 
R at this position. Another difference was noted at 
position 440 where E replaced K or S. To evaluate the 
significance of these substitutions , a series of point 
mutations were introduced into the MN-rgpl20 gene 

15 (Figure 5) • Plasmids expressing the mutant proteins v 
were transfected into 293s cells, and expression was 
verified by radioimmunoprecipitation with a monoclonal 
antibody (1034) directed to the V3 domain of 
MN-rgpl2 0. Cell culture supernatants were harvested 

20 and used for the monoclonal antibody binding studies 
shown in Table 6. To verify expression, radio- 
immunoprecipitation studies using cell culture 
supernatants from cells metabolically labeled with 
[ 35 ] S-methionine were performed using the 1024 

25 monoclonal antibody specific for the C4 domain of 

MN-rgpl20 (A) or the 1034 monoclonal antibody specific 
for the V3 domain of MN-rgpl20. Immune complexes were 
precipitated with the use of fixed S. aureus and the 
adsorbed proteins were resolved by SDS-PAGE. Proteins 

30 were visualized by autoradiography. The samples were: 
Lane 1, MN.419A; lane 2 MN.421A; lane 3 MN.429E; lane 
4, MN.429A; lane 5, MN.432A; lane 6, MN.440A; lane 7, 
MN-rgpl20. The immunoprecipitation study showed that 
1024 antibody binds well to all the variants except 3 

35 and 4 which are mutated at residue 429. 1034 antibody 
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was used as a control and precipitates with anti-V3 
antibodies. 

The effect of these mutations on the binding of 
the CD4 blocking monoclonal antibodies was then 
5 evaluated by ELISA as illustrated in Table 5, below. 

Table 5 

Binding of CD4 blocking monoclonal 
antibodies to C4 domain mutants 

10 



Proteins/ 
MAbs 


1024 


1093 


1096 


1097 


1110 


1112 


1127 


5C2 


MN-rgpl20 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


0.05 


MN-419A 


1.11 


1.10 


0.94 


1.21 


0.78 


0.95 


1.10 


ND 


MN-421A 


1.11 


1.60 


0.88 


1.42 


1.34 


0.91 


1.10 


ND 


MN-429E 


0.03 


0.07 


0.11 


0.04 


0.10 


0.10 


0.02 


ND 


MN-429A 


0.10 


0.07 


0.14 


0.04 


0.09 


0.11 


0.05 


ND 


MN-432A 


0.77 


0.15 


0.59 


0.08 


0.12 


0.24 


0.26 


ND 


MN-440A 


1.06 


1.13 


1.08 


0.87 


1.12 


1.0 


1.3 


ND 


IIIB-rgpl20 


0.03 


ND 


ND 


ND 


ND 


ND 


ND 


1.0 


MN-423F 


ND 


ND 


ND 


. ND 


ND 


ND 


ND 


0.45 


MN-423F,429E 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


1.09 



25 Data represent the relative binding of MAbs to the native and 
mutant forms of rgpl20. Values were calculated by dividing the 
binding (determined by ELISA) of the CD 4 blocking MAbs to the 
proteins indicated by the values obtained for the binding of a V3 
specific MAb (1034) to the same proteins (as described in 

3 0 Example 1 ) . 

It was found that replacement of KuoWith an A 
residue (MN.440A)had no effect on the binding of the 
1024 monoclonal antibody or any of the other CD4 

35 blocking monoclonal antibodies (Table 5). The 

significance of K at position 429 was then evaluated by 
substitution of either A (MN.429A) or E (MN.429E) at 
this location. It was found that the A for K 
substitution at position 429 (MN.420A) markedly reduced 

40 the binding of the 1024 monoclonal antibody and all of 
the other CD4 blocking monoclonal antibodies (Table 5) . 
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10 



Similarly, the replacement of E for K (MN.429E) at this 
position totally abrogated the binding of the 1024 
monoclonal antibody and all of the other CD4 blocking 
monoclonal antibodies (Table 5) . Several other mutants 
were constructed to evaluate the role of positively 
charged residues in the C4 domain. It was found that A 
for K substitutions at positions 419 (MN.419A) and 
421(MN.421A) failed to interfere with the binding of 
any of the CD4 blocking monoclonal antibodies as 
illustrated in Table 6, below. 



15 



20 



25 



Table 6 

Correlation Between Antibody Binding Affinity 
and Virus Neutralizing Activity 



nM* 



MAb 


Block 




0.4 


1024 e 




2.7 ± 0.9 


108 6 ef 




9.7 ± 2.2 




1093 e 


+ 


9.9 ± 2.6 


3.3 


1096 c 


+ 


10 ± 6 


12 


1097 c 


+ 


13.4 ± 3.7 


12 


1110 e 


+ 


12.1 ± 1.7 


12 


1112 e 


+ 


20 ± 4.4 


200 


1127 e 


+ 


9.3 ± 4 


3.3 


108 6 cf 




9.7 ± 2.2 




13H8 f '« 




22 ± 6 





30 



35 



Blocked binding of rgp!20 MN to CD4. - 

Blocked binding of rgpl20 Illb, not rgpl20 MN, to 

CD4. 

Mean of four determinations calculated using the 
method of Scatchard (40) . 

Neutralization of HIV-l,^ infectivity in vitro. 
Anti-rgpl2 0 MN antibody. 
Did not neutralize HIV-l infectivity. 
Anti-rgpl20 Illb antibody. 



-62- 



BNSDOCID: <WO ^9428929A1_I_> 



WO 94/28929 




PCT/US94/06036 



However, when K at position 4 32 was replaced with 
A (MN432.A), the binding of all of the CD4 blocking 
antibodies was markedly reduced (Table 5) . 
Interestingly, the binding of monoclonal antibody 1024 
5 appeared less affected by this substitution than the 
other monoclonal antibodies (Table 5) . Thus, these 
studies demonstrated that K 429 and K 432 were critical for 
the binding of all of the CD4 blocking monoclonal 
antibodies, and that K 419 , K 42I/ and K^o did not appear to 
10 play a role in monoclonal antibody binding. 

Amino acids recognized monoclonal antibodies that 
block binding of IIIB-rgpl2 0 to CD4 • The 

identification of residues 429 and 432 as being part of 

15 the epitope recognized by the MN-rgpl20 specific CD4 
blocking monoclonal antibodies was particularly 
interesting since this region was previously found to 
be implicated in the binding of the 5C2 monoclonal 
antibody (21). The properties of the 1024 like- 

20 monoclonal antibodies and the 5C2 monoclonal antibody 
differed from the C4 reactive monoclonal antibodies 
described by other investigators (4, 43) in that the 
former appeared strain specific and the latter were 
broadly cross reactive. To account for the strain 

25 specificity of these monoclonal antibodies, the 
sequence of the eleven amino acid peptide of 
IIIB-rgpl20 recognized by monoclonal antibody 5C2 was 
compared to the corresponding sequence of MN-rgpl20. 
It was found that the IIIB protein differed from the 

30 MNB protein at positions 429 where K replaced E and at 
position 423 where I replaced F (Figure 5) . Because it 
was known from previous studies (33) that the 5C2 
monoclonal antibody was unable to bind to gpl20 from 
two strains (i.e., NY-5 and JRcsf) that also possessed 

35 E at position 423, it seemed unlikely that this 

position could account for the strain specif icity of 
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5C2 . Sequence comparison (Figure 5) also showed that 
gpl20 from HIV-l^ was unique in that a phenylalanine 
residue occurred at position 423 whereas the other six 
strains examined possess an I at this position. 
5 To determine whether residues 423 and/or 429 could 

account for the type specificity of the 5C2 monoclonal 
antibody, a mutant of MN-rgpl20 was constructed which 
incorporated an F for I replacement at position 42 3 
(MN.423F). In addition, the MN-rgpl20 mutant, MN.429E 

10 (described above) was further mutagenized to 

incorporate a F for I substitution at position 42 3 
(MN.423F), thus resulting in a double mutant 
(MN. 423F, 429E) whose sequence was identical to that of 
IIIB-rgpl2 0 within the 10 amino acid 5C2 epitope 

15 (Figure 4). The expression of these mutants in 293 s-V* 
cells was verified by radioimmunoprecipitation using * 
rabbit polyclonal antisera to MN-rgpl2 0, When the 
binding of the 13H8 monoclonal antibody to a set of 
mutants incorporating substitutions at position 423 and 

20 429 was examined, it was found that none of the 

replacements effected the binding of this antibody 
(data not shown) . When the 5C2 monoclonal antibody was 
examined, it was found that the F for I replacement 
(MN.423 F) conferred partial reactivity (Table 5) • 

25 When the double mutant (MN. 423F, 429E) , containing the F 
for I substitution as well as the E for K substitution 
was tested, binding that was indistinguishable from 
that to IIIB-rgpl20 was observed (Table 5 ) . These 
results demonstrated that F at position 423 and E at 

3 0 position 429 both play a role in binding of the 5C2 
monoclonal antibody, and suggest that the strain 
specificity of 5C2 can be attributed to the residues at 
these positions. 

Examination of the sequences of gp!2 0 from the 

35 various clones of LAI that have been analyzed revealed 
that several substrains of LAI differed from each other 
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in the C4 domain. Thus the sequences of the IIIB (30) , 
Bru (46) , and HXB3 (6) clones of LAI were identical at 
positions 423 and 4 29 where F and E residues occurred 
respectively. However, the sequence of the HXB2 
5 substrain (36) differed from the others at these 

positions where, like MN-rgpl20, K replaced E and at 
position 423 where I replaced F (Figure 5) . Similarly, 
the HX10 and BH10 substrains (36, 37) differed only at 
position 423 where, like HIV-1^, I replaced F. Based 

10 on the mutagenesis experiments above, it would be 

predicted that monoclonal antibody 1024 should be able 
to bind to gpl20 from the HXB2 substrain of LAI, but 
not the HXB3 substrain. If I 423 was important for 
binding, then 1024 should also bind the HX10 substrain. 

15 To test this hypothesis, the binding of monoclonal 

antibody 1024 to the surface cells infected with either 
IIIB, HXB2, HXB3, and HX10 substrains of HIV- 1^ was 
measured by flow cytometry. It was found that 
monoclonal antibody 1024 was able to bind only HXB2 

20 providing further confirmation that residues 423 and 
4 29 were important for the binding of this antibody. 
The fact that monoclonal antibody 1024 did not bind to 
HX10 infected cells suggested that I 423 was not important 
for the binding of this monoclonal antibody . Thus 

25 these studies demonstrate that reactivity with the 1024 
monoclonal antibody segregates with the occurrence of F 
and E residues at positions 423 and 429, respectively, 
and shows that substrains of HIV-l^j differ from one 
another at a functionally significant epitope in the C4 

30 domain. 

Neutralizing activity of CD 4 blocking antibodies 
correlates with their binding affinity. To account for 
the difference in virus neutralizing activity between 
3 5 the CD 4 blocking monoclonal antibodies, their gpl20 
binding affinities were determined by competitive 
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binding of [ l25 I ] -labeled monoclonal antibody to rgpl20 
(Table 6) . Typical Scatchard analysis of data from 
these assays is shown in Figure 7 (A to C) . Linear, 
one-site binding kinetics were observed for all the 
5 monoclonal antibodies to MN-rgpl2 0, suggesting that 
only a single class of sites was recognized, and that 
there was no cooperativity between two combining sites 
of each immunoglobulin molecule. It was found 
(Figure 7A, Table 6) that monoclonal antibody 1024, 

10 which exhibited the most potent virus neutralizing 

activity (IC 50 of 0*08 fig per ml), possessed the lowest 

K d (2.7 nM) . In contrast (Figure 7C, Table 6), 

monoclonal antibody 1112, the antibody that exhibited 

the weakest virus neutralizing activity (IC 50 of 30 /xg , ^ 

15 per ml) possessed the highest K d (20 nM) . K d s for six ^ 

additional CD4 -blocking monoclonal antibodies raised & 
against MN-rgpl20 were also determined (Table 6) . It -L C 

was found that monoclonal antibodies that possessed ; 
intermediate K d s similarly possessed intermediate 

20 neutralization IC 50 values. To explore the relationship f 
between virus neutralizing activity and gpl20 binding * . % 

affinity, the data in Table 6 was plotted in several V> 3 

different ways. It was found that when the K d of the ^ * 

monoclonal antibodies was plotted as a function of the 

25 log of the IC 50 , a linear relationship was obtained .■- " 

(Figure 8). Using this analysis a correlation 
coefficient (r) of 0.97) was obtained. Thus, this 
graph demonstrates that the virus neutralizing activity 
of these monoclonal antibodies is directly proportional 

3 0 to the gpl2 0 binding affinity, and that the threshold 
for neutralization at this epitope is defined by the 
slope of the graph in Figure 8. 

A similar analysis was performed with the non- 
neutralizing CD4 blocking monoclonal antibodies to 

35 IIIB-rgpl20, 5C2 and 13H8. The binding curve for 13H8 
(Figure 7C) showed that it bound to a single class of 



BNSDOCID: <WO 9428929A1_I_> 



WO 94/28929 




PCT/US94/06036 



sites on IIIB-rgpl20 with a K d of 22 nM. The affinity 
of 5C2 could not be determined by this assay because at 
antibody concentrations greater than 5 nM, non-linear 
(reduced gpl20 binding) was observed. This effect was 
5 suggestive steric hindrance at these concentrations or 
negative cooperativity between combining sites. The 
binding affinity was also determined for the non- 
neutralizing, non-CD4 blocking monoclonal antibody to 
MN-rgpl20, 1086. The fact that this antibody exhibited 
10 a binding affinity similar (9,7 nM) to many of the 
neutralizing monoclonal antibodies but failed to 
inhibit infectivity, proves that high antibody binding 
affinity alone is not sufficient for neutralization. 

15 Effect of C4 Domain Mutants on CD4 binding. 

Finally, the CD4 binding properties of the series of x 
MN-rgpl2 0 mutants, constructed to localize the C4 
domain epitopes, were measured in a qualitative co- 
immunoprecipitation assay. In these studies the 

2 0 ability of the mutagenized MN-rgpl20 variants to co- 

immunoprecipitate CD4 was evaluated as described 
previously (21) in a qualitative co-immunoprecipitation 
assay similar to that described previously (19) ♦ 
Briefly, 293 cells, transfected with plasmids directing 
25 the expression of MN-rgpl20 variants described in 
Figure 5, were metabolically labeled with 
[ 35 S] -methionine,, and the growth conditioned cell 
culture supernatants were incubated with rsCD4 . The 
resulting rsCD4:gpl2 0 complexes were then 

3 0 immunoprecipitated by addition of the CD4 specific 

monoclonal antibody, 465 (A) or a positive control 
monoclonal antibody (1034) directed to the V3 domain of 
MN-rgpl20 (B) . The immunoprecipitated proteins were 
resolved by SDS-PAGE and visualized by autoradiography 
35 as described previously (3) . The samples were: Lane 
1, MN.419A; lane 2, MN.421A; lane 3, MN.429E; lan 4, 
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MN.429A; lane 5, MN.432A; lane 6, MN.440A; lane 7, 
MN-rgpl20. The gel showed that the mutants that block 
antibody binding do not block binding of CD4 . 
Therefore, the antibodies do not bind to the gpl20 CD4- 
5 binding contact residues. This indicates that steric 
hinderance may inhibit antibody binding, rather than 
that the antibodies bind directly to the CD4 contact 
residues to inhibit binding. 

It was found that all of the variants in which 

10 apolar A residue was substituted for the charged K or E 
residues (e.g., MN.419A, MN.421A, MN.432A, and MN.440A) 
were still able to cd-immunoprecipitate rsCD4 . 
Similarly, the replacement of E for K at position 429 
(MN.429E), the replacement of F for I at position 423 

15 (MN.423F) or the mutant which incorporated both 

mutation (MN. 423F, 429E) also showed no reduction in 
their ability to co-immunoprecipitate rsCD4. Thus, 
radical amino acid substitutions at five positions 
failed to affect the binding of gpl20 to CD4 . These 

20 results were consistent with previous studies (5, 21, 
34) where it was found that only a few of the many 
mutations that have been induced in this region 
effected CD4 binding. 

This study indicates that neutralizing epitopes in 

25 the C4 domain have now been found to be located between 
about residues 420 and 440. In addition, the critical 
residues for antibody binding are residues 429 and 432. 

EXAMPLE 2 

3 0 Identification of V2 Neutralizing Epitopes 

The procedures described in Example l were used to 
map epitopes in the V2 region of gp!20. Table 7 
illustrates the results of mutagenicity studies to map 
V2 neutralizing epitopes. In the table, the columns 

3 5 indicate the comparison of binding of the monoclonal 
antibodies with wild type (WT) gpl20 in comparison to 
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10 



various mutations of gpl2 0 using standard notation. 
For example, "G171R" indicates that the glycine (G) at 
residue 171 has been replaced by an arginine (R) . 
"172A/173A U indicates that the residues at 172 and 173 
have been replaced by alanine. The neutralizing 
monoclonal antibodies tested (MAbs) are listed in the 
rows. The numerical values in the table are the 
optical density value of an ELISA assay performed as 
described in Example 1 to measure the amount of 
antibody binding. The underlined values indicate 
significantly reduced binding, indicating the 
substituted residue is critical for binding of the 
antibody. 



15 



TABLE 7 



MAbs 



WT G171R, 172A/ E187V 187V/188S 

M174V 173A 





6E10 


1.00 


0.10 


1.28 


0.60 


0.25 




1017 


1.00 


0.70 


1.10 


0.87 


0. 04 


20 


1022 


1.00 


0.80 


1.10 


1.00 


0. 00 




1028 


1.00 


0.90 


1. 18 


1.07 


0. 04 




1029 


1.00 


0.83 


1.16 


1.01 


0.16 




1019 


1.00 


0. 13 


1.30 


0.75 


0.74 




1027 


1.00 


0.00 


1.20 


0.80 


0. 64 


25 


1025 


1.00 


0. 69 


0. 00 


0. 00 


0. 83 




1088 


1. 00 


0.73 


1. 12 


0.94 


0. 03 




13H8 


1.00 


0.77 


0. 78 


0.48 


0.65 
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TABLE 7 (continued) 







WT 


177A 


172A/173A 


188A 


183A 




MAbs 














6E10 


1.00 


0.36 


0.52 


0.64 


0.43 


5 


1017 


1. 00 


0.77 


0.77 


0.76 


0. 11 




1022 


1. 00 


0.86 


0.72 


0.14 


0. 00 




1028 


1.00 


0.93 


0.78 


0.49 


0. 04 




1029 


1.00 


0.88 


0.85 


0.53 


0. 16 




1019 


1.00 


0.16 


0.00 


0.41 


0.44 


10 


1027 


1.00 


0. 00 


0. 02 


0.41 


0.49 




1025 


1.00 


0.75 


0.0 


0.83 


0.72 




1088 


1.00 


0.77 


0.77 


0.53 


0. 00 




13H8 


1.00 


0.72 


0.72 


0.53 


0.60 



15 As illustrated in Table 7, the study demonstrated 

that there are a series of overlapping neutralizing 
epitopes from been found to be located in the V2 region 
(residues 163 through 2 00) , with most of the epitopes 
located between residues 163 and 200. In addition, the 

20 study indicates that the critical residues in the V2 

domain for antibody binding are residues 171, 173, 174, 
177, 181, 183, 187, and 188.. 

EXAMPLE 3 

25 Immunization Studies 

gpl20 from the MN,,GNE 8 , and GNE 16 strains of HIV 
was prepared by amplifying the gene from each isolate 
and cloning and expressing the gene in CHO cells as 
described in Berman et al., J . Virol. 66:4464-4469 

30 (1992) . Briefly, the gp!60 gene was amplified with two 
rounds of amplification using the following nested 
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primers according to the protocol by Kellog et al. , pp 
3 3 7-347 in PCR Protocols : a gruide to methods and 
amplification. Innis et al. (eds.) Academic Press, 
Inc. , New York. 
5 First round primers: 

AATAATAGCAATAGTTGTGTGGWCC (W is A or T) 
ATTCTTTCCCTTAYAGTAGGCCATCC (Y is T or C) 
Second round primers: 

GGGAATTCGGATCCAGAGCAGAAGACAGTGGCAATGA 

10 GTCAAGAATTCTTATAGCAAAGCCCTTTCCAA 

The primers are SEQ. ID. NOs. 31-34. Each gene is then 
digested with the restriction endonucleases Kpnl and 
Accl. The resulting fragment was subcloned into the 
Bluescript ( + ) phagemid M13 vector (Stratagene, Inc.) 

15 and sequenced by the dideoxynucleotide method (Sanger 
et al., Proc. Natl. Acad. Sci . USA 74:5463-5467 
(1977)). 

A fragment of the gpl2 0 coding region was then 
used to construct a chimeric gene for expression in 

20 mammalian cells, as described in Lasky et al., Science 
223:209-212 (1986). The 5' end was fused to a 
polylinker adjacent to a simian virus 40 (SV40) 
promoter and the 3 1 end was fused to a polylinker 
adjacent to the 3 ' untranslated sequences containing an 

25 SV40 polyadenylation signal . The expression vector 

(MN-rgpl20) was co-transf ected in CHO cells deficient 
in production of the enzyme dihydrof olate reductase, 
along with a plasmid (pSVdhfr) containing a cDNA 
encoding the selectable marker, dihydrof olate 

3 0 reductase. Cell lines expressing MN-rgpl2 0 were 
isolated as described in Lasky et al., Science 
223:209-212 (1986). The recombinant glycoprotein was 
purified from growth-conditioned cell culture medium by 
immunoaf f inity and ion exchange chromatography as 

35 described in Leonard et al., J . Biol . Chem. 265:10373- 
10382 (1990). 
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gpl20 from the GNE 8 and GNE 16 strains of HIV is 
prepared in the same manner as described for the MN 
isolate. 

MN-rgpl20 (300 jig/ injection) , GNE g -rgpl20 (300 y.g/ 
5 injection), and GNE 16 -rgpl20 ' (300 ■ ^g/ injection) are 
prepared in an aluminum hydroxide adjuvant (as 
described in Cordonnier et al. , NaturB 340:571-574 
(1989)). Six chimpanzees are injected at 0, 4, and 32 
weeks. Sera are collected and assayed for neutralizing 
10 antibody to each strain of HIV at the time of each 

immunization and three weeks thereafter. At 35 weeks, 
each of the chimpanzees has significant levels of 
neutralizing antibodies to each strain. 

At 35 weeks, the chimpanzees are randomly assigned 
15 to three groups. Each group is challenged with aboiit 
10 50% chimpanzee-infectious doses (CID 50 ) each of one" 
of the vaccine isolates. One unimmunized chimpanzee 
(control) is also injected with the same amount of 
virus as the immunized chimpanzees for each vaccine 
20 strain. 

Sera are drawn every two weeks throughout the 
study and assayed for antibodies to HIV core proteins 
and for the presence of HIV by PCR amplification and 
co-cultivation of peripheral blood mononuclear cells 
25 (PBMCs) from the chimpanzee together with activated 

human or chimpanzee PBMCs. The presence of antibodies 
to core proteins indicates the presence of viral 
infection as does the detection of amplified viral DNA 
or viral infection of co-cultivated cells. 
3 0 The presence of virus is detected by PCR and 

co-cultivation methods in each unimmunized control 
animal between weeks 2 and 4 post challenge. 
Antibodies to core proteins appear in the control 
chimpanzees at six weeks post challenge. Neither virus 
35 nor antibodies are at detectable levels in any of the 
immunized chimpanzees at one year post challenge, 
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indicating that the vaccine effectively protects the 
chimpanzees from infection from each of the challenge 
strains. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
5 

(i) APPLICANT: Berman, Phillip W. 

Nakamura, Gerald R. 

(ii) TITLE OF INVENTION: HIV Envelope Polypeptides 

10 

(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Skjerven, Morrill, MacPherson, Franklin & 
15 Friel 

(B) STREET: 25 Metro Drive Suite 700 

(C) CITY: San Jose 

(D) STATE: California 

(E) COUNTRY: U.S.A. 
20 (F) ZIP: 95110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

2 5 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

3 0 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Terlizzi, Laura 

3 5 (B) REGISTRATION NUMBER: 31,3 07 

(C) REFERENCE/ DOCKET NUMBER: M-2820-1P 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (408) 283-1222 
40 (B) TELEFAX: (408) 283-1233 

(2) INFORMATION FOR SEQ ID NO : 1 : 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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Met Arg Val Lys Gly lie Arg Arg Asn Tyr Gin His Trp Trp Gly Arg 
1 5 10 15 

Gly Thr Met Leu Leu Gly Leu Leu Met lie Cys Ser Ala Thr Glu Lys 
5 20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 

10 Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Ala 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

15 

Gin Glu Val Glu Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp Lys 

85 90 95 

Asn Asn Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
20 100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

2 5 Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn Thr Asn Asn Ser Thr Asp 

130 135 140 

Asn Asn Asn Ser Lys Ser Glu Gly Thr lie Lys Gly Gly Glu Met Lys 
145 150 155 160 

30 

Asn Cys Ser Phe Asn lie Thr Thr Ser lie Gly Asp Lys Met Gin Lys 

165 170 175 

Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie Glu Pro lie Asp Asn Asp 

3 5 180 185 190 -A 

Ser Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin 
195 200 205 

4 0 Ala Cys Pro Lys lie Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala 

210 215 220 

Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys Phe Ser Gly 
225 230 235 ~ 240 

45 

Lys Gly Ser Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie 

245 250 255 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
50 260 265 270 

Glu Glu Val Val lie Arg Ser Glu Asp Phe Thr Asp Asn Ala Lys Thr 
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275 280 285 

lie lie Val His Leu Lys Glu Ser Val Gin lie Asn Cys Thr Arg Pro 
290 295 300 

5 

Asn Tyr Asn Lys Arg Lys Arg lie His lie Gly Pro Gly Arg Ala Phe 
305 310 315 320 

Tyr Thr Thr Lys Asn lie Lys Gly Thr lie Arg Gin Ala His Cys lie 
10 325 330 335 

lie Ser Arg Ala Lys Trp Asn Asp Thr Leu Arg Gin lie Val Ser Lys 
340 345 350 

15 Leu Lys Glu Gin Phe Lys Asn Lys Thr lie Val Phe Asn Pro Ser Ser 
355 360 365 

Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu 
370 375 380 

20 

Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser lie Trp Asn Gly 
385 390 395 400 

Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn lie Thr Leu 
25 405 410 415 

Gin Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys 
420 425 430 

3 0 Ala Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys Ser Ser Asn 
435 440 445 

lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Asp Thr Asp Thr 
450 455 460 

35 

Asn Asp Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
465 470 475 480 

Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Thr lie Glu Pro Leu 
40 485 490 495 

Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu 
500 505 510 

(3? INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
5 0 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys Tyr Ala Leu Ala Asp Ala Ser Leu Lys Met Ala Asp Pro Asn Arg 
1 5 10 15 

Phe Arg Gly Lys Asp Leu Pro Val Leu Asp Gin Leu Leu Glu Val Pro 
20 - 25 30 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
35 40 45 

Ala Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val 
50 55 60 

Pro Thr Asp Pro Asn Pro Gin Glu Val Glu Leu Val Asn Val Thr Glu 
65 70 75 80 

Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

85 90 95 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
100 105 110 

Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Asn Thr -Thr Asn 
115 120 125 

Thr Asn Asn Ser Thr Asp Asn Asn Asn Ser Lys Ser Glu Gly Thr lie 
130 135 140 

Lys Gly Gly Glu Met Lys Asn Cys Ser Phe Asn lie Thr Thr Ser lie 
145 150 155 160 

Gly Asp Lys Met Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie 

165 170 175 

Glu Pro lie Asp Asn Asp Ser Thr Ser Tyr Arg Leu lie Ser- Cys Asn 
180 185 190 

Thr Ser Val lie Thr Gin Ala Cys Pro Lys lie Ser Phe Glu Pro lie 
195 200 205 

Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn 
210 215 220 

Asp Lys Lys Phe Ser Gly Lys Gly Ser Cys Lys Asn Val Ser Thr Val 
225 230 235 240 

Gin Cys Thr His Gly lie . Arg Pro Val Val Ser Thr Gin Leu Leu Leu 

245 250 255 

Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Glu Asp Phe 
260 265 270 
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Thr Asp Asn Ala Lys Thr lie lie Val His Leu Lys Glu Ser Val Gin 
275 280 285 

lie Asn Cys Thr Arg Pro Asn Tyr Asn Lys Arg Lys Arg lie His lie 
5 290 295 300 

Gly.Pro Gly Arg Ala-Phe Tyr Thr Thr Lys Asn lie Lys Gly Thr lie 
305 310 315 320 

10 Arg Gin Ala His Cys lie lie Ser Arg Ala Lys Trp Asn Asp Thr Leu 

325 330 335 

Arg Gin lie Val Ser Lys Leu Lys Glu Gin Phe Lys Asn Lys Thr lie 
340 345 350 

15 

Val Phe Asn Pro Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser 
355 360 365 

Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe 
20 370 375 380 

Asn Ser lie Trp Asn Gly Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser 
385 390 395 400 

2 5 Asn Asn Asn lie Thr Leu Gin Cys Lys lie Lys Gin lie lie Asn Met 

405 410 415 

Trp Gin Lys Val Gly Lys Ala Met Tyr Ala Pro Pro lie Glu Gly Gin 
420 425 430 

30 

lie Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly 
435 440 445 

Gly Glu Asp Thr Asp Thr Asn Asp Thr Glu lie Phe Arg Pro Gly Gly 
35 450 455 460 

Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val 
465 470 475 480 

4 0 Val Thr lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg 

485 490 495 

Val Val Gin Arg Glu 
500 

45 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
50 (B) TYPE: amino acid . 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE. DESCRIPTION: SEQ ID NO : 3 : 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
5 1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2P INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 

25 20 .25 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

40 

Met Tyr Ala Pro Pro lie Lys Gly Gin lie Arg Cys 
20 25 

( 2) INFORMATION FOR SEQ ID NO: 6 : 
45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 
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(xi) SEQUENCE DESCRIPTION: SEQ I 

Cys Arg lie Lys Gin lie lie Asn 
1 5 

5 

Met Tyr Ala Pro Pro lie Glu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 7: 
10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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i NO: 6: 

Met Trp Gin Gly Val Gly Lys Ala 
10 15 

Gin lie Asn Cys 
25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

2 0 Cys Arg lie Lys Gin lie lie Asn Arg Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

lie Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Arg lie Lys Gin lie Val Asn Met Trp Gin Arg Val Gly Gin Ala 
1 5 10 15 

4 0 Met Tyr Ala Pro Pro lie Lys Gly Val lie Lys Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 9: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Gly Ala Gly Gin Ala 
1 5 10 15 

Met Tyr Ala Pro Pro lie Ser Gly Thr lie Asn Cys 
5 20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

20 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 11: 
25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

35 Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
1 5 10 15 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

40 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids. . 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
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15 10 15 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

2 0 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 14: 

25 (i) SEQUENCE CHARACTERISTICS.: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly 
35 1 5 10 15 

Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser lie Trp Asn 
20 25 30 

4 0 Gly Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn lie Thr 
35 40 45 

Leu Gin Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly 
50 55 60 

Lys Ala Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys Ser Ser 
65 70 75 80 



45 



Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
50 85 90 

(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

10 Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5, ' 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Ala Val Gly Lys Ala 
1 5 10 15 

3 0 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Cys Ala lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
4 5 1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(SP INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

Cys Lys lie Ala Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
10 1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2£ INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Ala Ala 
1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
30 20 25 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
1 5 10 15 

45 

Met Tyr Ala Pro Pro lie Ala Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 21: 
50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 22: 
15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



25 Cys Lys lie Lys Gin Phe lie Asn Met Trp Gin Lys Val Gly Lys Ala 
1 5 10 15 



30 



Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
3 5 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Cys Lys lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

45 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 24: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
1 5 10 15 

10 lie Ser 



(2) INFORMATION FOR SEQ ID NO: 25: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Lys Ala Met Tyr Ala Pro Pro lie Lys Gly Gin lie Arg 
1 5 10 

40 
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WHAT IS CLAIMED IS: 

1. A method for making an HIV gp!2 0 subunit vaccine 
for a geographic region comprising the steps of: 

5 a. determining a neutralizing epitope in the V2 

or C4 domain of gpl2 0 of HIV isolates from 
the geographic region; 

b. selecting an HIV strain having gpl2 0 which 
has a neutralizing epitope in the V2 or C4 

10 domain which is common among isolates in the 

geographic region; and 

c. making an HIV gpl2 0 subunit vaccine from the 
selected isolate. 

2. The method of Claim 1 wherein the neutralizing 
15 epitope is determined by determining the amino 

acid sequence for at least a portion of the V2 or 
C4 domain* 

3* The method of Claim 2 wherein the amino acid 

sequence is determined by sequencing DNA encoding 
20 at least a portion of the V2 or C4 domain. 

4 . The method of Claim 3 wherein a plurality of 
isolates having different amino acid sequences for 
the V2 and C4 domains are selected. 

5. The method of Claim 4 wherein a plurality of 

25 isolates having different amino acid sequences for 

the V3 domain is selected. 

6. A method for making an HIV gp!20 subunit vaccine 
for a geographic region comprising the steps of: 

a. determining neutralizing epitopes for the V2 , 
30 V3, and C4 domains of gp!20 from HIV isolates 

from the geographic region; 

b. selecting at least two HIV isolates having 
different neutralizing epitopes in the V2 , 
V3 , or C4 domain; and 

3 5 c. making an HIV gpl2 0 subunit vaccine from the 

selected isolates . 
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7. The method of Claim 6 wherein each of the selected 
isolates have one of the most common neutralizing 
epitopes. 

8. A method for making an HIV gpl20 subunit vaccine 
5 for a geographic region comprising the steps of: 

a. determining the neutralizing epitopes for HIV 
isolates and the percentage of HIV infections 
attributable to each strain present in the 
region; 

10 b. selecting at least two HIV strains which have 

the most common neutralizing epitopes in the 
V2, V3 , and C4 domains in the geographic 
region; and 

c. making an HIV gpl2 0 subunit vaccine from the 
15 selected isolates . 

9. The method of Claim 8 wherein the isolates are 
primary patient isolates. 

10. The method of Claim 8 wherein the geographic 
region is North America and the amino acid 

20 sequence of gpl20 from the HIV isolates MN and GNE 8 

are selected. 

11. The method of Claim 10 wherein the GNE l6 isolate is 
also selected. 

12. A multivalent HIV gpl2 0 subunit vaccine. 

25 13. The vaccine of Claim 12 wherein gpl20 present in 
the vaccine is from at least two HIV isolates 
which have a different neutralizing epitope in the 
V2 or C4 domain of gpl20. 

14. The vaccine of Claim 12 wherein gp!20 present in 
3 0 the vaccine is from at least two HIV isolates 

which have a different neutralizing epitope in the 
V3 domain of gpl20. 

15. The vaccine of Claim 12 wherein each isolate has a 
different common neutralizing epitope for the V2 

3 5 or C4 domains of gp!2 0. 
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20 

23. 
24. 
25 25. 

26. 
30 27. 
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The vaccine of Claim 1 wherein gpl20 present in 
the vaccine is from the MN and GNE 8 strains of HIV. 
The vaccine of Claim 1 wherein gp!20 from the GNE, 6 
strain of HIV is also present in the vaccine. 
A DNA sequence of less than 5 kilobases encoding 
gpl20 from GNE 8 and having the nucleotide sequence 
illustrated in Table 1. 

A DNA sequence of less than 5 kilobases encoding 
gpl2 0 from GNE 16 and having the nucleotide sequence 
illustrated in Table 2. 

An expression construct comprising DNA encoding 
gpl20 selected from the group consisting of 
GNE 8 -gpl20 and GNE 16 -gpl2 0 under the transcriptional 
and translational control of a heterologous 
promoter. 

The expression construct of Claim 20 wherein the 
promoter is a eukaryotic promoter- 
The expression construct of Claim 21 wherein the 
DNA encoding gpl2 0 is joined to a heterologous 
signal sequence. 

An isolated GNE 8 -gpl2 0 polypeptide having the amino 

acid sequence illustrated in Table 1. 

An isolated GNE 16 -gpl2 0 polypeptide having the 

amino acid sequence illustrated in Table 2. 

An improved serotyping method for HIV strains 

comprising determining the serotypes of the V2 , 

V3 , and C4 domains of gpl20. 

A truncated gpl2 0 sequence which sequence is free 
from the C5 domain. 

The truncated gpl2 0 sequence of Claim 26 wherein 
the sequence is additionally free from the carboxy 
terminus through amino acid residue 453 of the 
gpl20 V5 domain. 

The truncated gpl20 sequence of Claim 27 wherein 
the sequence is additionally free from the gpl20 
V5 domain. 
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29. The truncated gpl20 sequence of Claim 26 wherein 
the sequence is additionally free from the gpl20 
signal sequence. 

30. The truncated gp!2 0 sequence of Claim 29 wherein 

5 the sequence is additionally free from the carboxy 

terminus through amino acid residue 111 of the 
gpl2 0 CI domain. 

31. The truncated gpl20 sequence of Claim 29 wherein 
the sequence is additionally free from the carboxy 

10 terminus through amino acid residue 117 of the 

gpl20 CI domain. 

32. The truncated gpl20 sequence of Claim 26 wherein 
the sequence is free from the amino terminus of 
gpl20 through residue 111 of the CI domain and 

15 residue 453 through the carboxy terminus of gpl20. 

33. The truncated gp!20 sequence of Claim 26 wherein 
the sequence is produced by recombinant 
engineering. 
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